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^ (54) Title: 31939, A NOVEL HUMAN LEUCINE-RICH REPEAT FAMILY MEMBER AND USES THEREOF 

^ (57) Abstract: The invention provides isolated nucleic acids molecules, designated 31939 nucleic acid molecules, which encode 
^ novel leucine-rich repeat (LRR) members. The invention also provides antisense nucleic acid molecules, recombinant expression 
vectors containing 31939 nucleic acid molecules, host cells into which the expression vectors have been introduced, and nonhuman 
Q transgenic animals in which a 31939 gene has been introduced or disrupted. The invention still further provides isolated 31939 
^ proteins, fusion proteins, antigenic peptides and anti-31939 antibodies. Diagnostic methods utilizing compositions of the invention 
^ are also provided. 
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31 939, A NOVEL HUMAN LEUCINE-MCH REPEAT FAMILY MEMBER 

AND USES THEREOF 

5 Related Applications 

This application claims priority to U.S. provisional application number 60/193,919 
filed on March 3 1, 2000, the contents of which are incorporated herein by reference. 

Background of the favention 

Leucine rich repeat-containing proteins are a class of polypeptide molecules with 
10 diverse functions and cellular locations in a variety of organisms (Buchanan et al (1 996) 
Prog, Biophys, Molec. Biol 65: 1-44; Kobe etal (1994) Trends in Biochem Sci. 19(10): 
415-421). Leucine rich repeats (LRRs) are often present in tandem, varying in number 
firom one, as in, for example, platelet glycoproteui IbB, to about 30, as in, e.g., chaoptin 
(Kobe, B. and Deisenhofer, J. (1 994) supra). Common lengths of LRRs are betwera 20 and 
15 29 residues. 

The three-dimensional architecture of LRRs has been recent^ characterized based on 
the crystal structure of tibie porcine ribonuclease inhibitor protein OK.obe etal (1993) Nature 
366:751-756). bi flie ribonuclease inhibitor protein, LRRs coirespond to B-a structural 
units, consisting of a short fi-strand and an a-helix approximately parallel to eadi other 

20 (Kobe et al (1994) supra). All repeats, including the terminal segments, adopt very similar 
structures, consisting of about 28 or 29 residues, except the amino terminal repeat wfaidi 
consists of 25 residues. The structural units are arranged so that all the fi-strands and the 
helices are parallel to a common axis, resulting in a non-globular, horse shoe-shq)ed 
molecule with a curved parallel 6-sheet lining the inner circumference of flie horse shoe, and 

25 the helices flanking its outer circumference. 

LRRs are foxmd in functionally and evolutionarily diverse proteins. LRR- containing 
proteins appear to be involved in mediating protein-protein interactions, and at least half of 
them participate in signal transduction pathways (Buchanan et al (1 996) supra). The 
specificity of the protem-protein interactions of the LRR-containing proteins may result 

30 firom &e composition of nonconsensus residues, and the leng& of the repeats and the 
flanking domains. LRR-containing molecules can be grouped into several categories, 
including: proteins related to ribonuclease inhibitor proteins, adhesive proteins, and signal 
transduction receptors (Kobe et al (1 994) supra\ Buchanan et al (1 996) supra). 
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Summary of the Invention 

The present invention is based, in part, on the discovery of a novel leucine-rich 
repeat (LRR) family member, referred to herein as "3 1 939''. The nucleotide sequence of a 

5 cDNA encoding 3 1 939 is shown in SEQ ID N0:1, and tiie amino acid sequence of a 3 1 939 
polypeptide is shown in SEQ ID N0:2. In addition, the nucleotide sequences of tiie cojding 
r^on are depicted in SEQ ID N0:3, j 
Accordingly, in one aspect, the invention features a nucleic acid molecule that 
encodes a 31939 protein or polypeptide, e.g., a biologicalfy active portion of tiie 31939 

0 proteia In a preferred embodiment the isolated nucldc add molecule encodes a 

polypeptide having tiie amino acid sequence of SEQ ID N0:2. In otiier embodimente, tiie 
invention provides isolated 31939 nucleic acid molecules having the nucleotide sequence 
shown in SEQ ID N0:1, SEQ ID N0:3, or the sequence of tiie DNA insert of the plasmid 
deposited witii ATCC Accession Number . In still other embodiments, ttie invention 

5 provides nucleic acid molecules that are substantially identical (e.g., naturally occurring 
allelic variants) to tiie nucleotide sequence shown in SEQ ID N0:1, SEQ ID N0:3, or tiie 

sequence of tiie DNA insert of tiie plasmid deposited witii ATCC Accession Number . 

In otiier embodiments, tiie invention provides a nucleic add molecule vAddi hybridizes 
under a stringency condition described herein to a nucleic acid molecule comprising the 

0 nucleotide sequence of SEQ ID N0:1, SEQ ID N0:3, or the sequence of the DNA insert of 

tiie plasmid deposited with ATCC Accession Number , wherein the nucleic acid 

encodes a full length 3 1 939 protein or an active fragment thereof. 

In a related aspect, the invention fiirther provides nucleic acid constructs that include 
a 31939 nucleic acid molecule described herein. In certain embodiments, the nucleic acid 

5 molecules of the invention are operatively linked to native or heterologous regulatory 
sequences. Also included, are vectors and host cells containing the 31939 nucleic acid 
molecules of tiie invention e.g., vectors and host cells suitable for producing 31939 nudeic 
acid molecules and polypeptides. 

In another related aspect, the invention provides nucldc acid fragments suitable as 

) primers or hybridization probes for flie detection of 3 1 939-encoding nucleic adds. 

In still another related aspect; isolated nucleic acid molecules tiiat are antisense to a 
31939 encoding nucleic acid molecule are provided. 
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In another aspect, the invention features, 31939 polypeptides, and biologically active 
or antigenic fragments thereof that aie useful, e.g. , as reagents or targets in assays appUcable 
to treatment and diagnosis of 3 1939-mediated or -related disorders. In another embodiment, 
the invention provides 31939 polypeptides having a 31939 activity. Preferred polypeptides 
5. are 3 1939 proteins including at least one LRR domain, and optionally an immunoglobulin 
domain, and an intracdlular domain, e.g., an intracellular signaling domain, and, preferabty, 
having a 31939 activity, e.g., a 31939 activity as described herem. 

In other embodiments, the invention provides 31939 polypeptides, e.g., a 31939 
polypeptide having the amino add sequence shown in SEQ ID N0:2 or Has amino acid 
10 sequence encoded by &e cDNA insert of the plasmid deposited with ATCC Accession 

Number ; an amino acid sequence Aat is substantialfy identical to ibs amino acid 

sequence shown in SEQ ID N0:2 or the amino add sequaice aicoded by the cDNA inswt 

of the plasmid deposited with ATCC Accession Number ; or an amino acid sequence 

encoded by a nucleic acid molecule having a nucleotide sequence whidi hybridizes under a 
15 stringency condition described herein to a nucleic acid molecule comprising the nudeotide 
sequence of SEQ ID N0:1, SEQ ID N0:3, or the sequence of the DNA insert of the plasmid 

deposited with ATCC Accession Number , wherein tiie nucldc add encodes a fiiU 

leog&i 31939 protein or an active fragment tiiereof 

In a related aspect, the invention further provides nucleic add constracts which 
20 indude a 3 1939 nucldc add molecule described herein. 

In a related aspect tiie invention provides 31939 polypeptides or fragments 
operatively linked to non-31939 polypeptides to form fusion proteins. 
In anotiier aspect, tiie invention features antibodies and antigen-binding fragments tiiereof, 
that react with, or more preferably specifically bind 31939 polypeptides. 
25 In anotiier aspect, tiie invention provides mefliods of screening for compounds tiiat 

modulate tiie expression or activity of tiie 31939 polypeptides or nucleic adds. 

In still anotiier aspect, tiie invention provides a process for modulating 3 1939 
polypeptide or nucleic add expression or activity, e.g. using tiie screened compounds. In 
certain embodiments, tiie metiiods involve treatinent of conditions related to aberrant 
30 activity or expression of tiie 3 1939 polypeptides or nucldc adds, such as conditions 

involving aberrant or deficient a cell proliferative or diflferentiative disorder, e.g., cancer or a 
neuronal disorder. 
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Iq yet another aspect, the invention provides methods for inhibiting the proliferation 
or migration , or inducing the differentiation or killing, of a 3 1 939-expressing cell. The 
method includes contacting &e cell with an agent, e.g., a compound (e.g., a compound 
identified using tiie methods described herein) that modulates the activity, or expressioo, of 

5 tiie 31939 pofypeptide or nucleic acid. In a preferred onbodiment, the contacting step is 
effective in vitro or ex vivo, fo ofter embodiments, Ae contacting step is effected iW VIVO, 
e.g., in a subject (e.g., a mammal, e.g., a human), as part of aflierapeutic or proplqrlactic 
protocol In a preferred embodiment, tiie cdl is a iQTperproIiferative cell, e.g., a cell found in 
a solid tumor, a sofi; tissue tumor, or a metastatic lesion. 

10 In a preferred embodiment, tiie agent is an inhibitor of a 31939 pofypeptide. 

Preferably, the inhibitor is chosen fi:om a peptide, a phosphopeptide, a small organic 
molecule, a small inorganic molecule and an antibody (e.g., an antibody conjugated to a 
ther^eutic moiety selected from a cytotoxin, a cytotoxic agent and a radioactive metal ion), 
hi another preferred embodiment, the agent is an inhibitor of a 31939 nucleic acid, e.g., an 

15 antisense, a ribozyme, or a triple helix molecule. 

In a preferred embodiment, &e agent is administered in combination with a ototoxic 
agent. Examples of cytotoxic agents include anti-microtubule agent, a topoisomerase I 
inhibitor, a topoisomerase n inhibitor, an anti-metabolite, a mitotic inhibitor, an alkylating 
agent, an intercalating agent, an agent capable of interfering with a signal ttansduction 

20 pathw^, an agent that promotes apoptosis or necrosis, and radiation. 

Ill another aspect tiie invention features mediods for treating or preventing a disorder 
diaracteiized by aberrant cellular proliferation, survival, migration or differentiation of a 
31939-fflq)ressing cell, ui a subject. Preferabty, the me&od includes administering to the 
subject (e.g., a mammal, e.g., a human) an effective amount of an agrait, e.g., a compound, 

25 (e.g., a compound identified using the mefliods described herein) that modulates the activity, 
or ejqpression, of flie 31939 polypeptide ornucleic add. 

In a preferred embodiment, the disorder is a cancwous or pre-cancerous condition. 
In oHier anbodiments, the disorder is a neurological or brain disorder. 

In a further aspect, the invention provides methods for evaluating the efiBcacy of a 

30 treatment of a disorder, e.g., proliferative disordor or a neuronal disorder, the method 
includes: treating a subject, e.g., a patient or an animal, witii a protocol under evaluation 
(e.g., treating a subject with one or more of: chemotherapy, radiation, and/or a compound 
identified using the methods described herein); and evaluating the expression of a 31939 
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nudac acid or polypeptide before and after treatment A change, e.g., a decrease or 
increase, in the level of a 3 1939 nucleic acid (e.g., mRNA) or polypeptide after treatment, 
relative to the level of expression before treatment, is indicative of the efficacy of the 
treatment of the disorder. The level of 31939 nucleic acid or polypeptide expression can be 

5 detected by any method described herein. 

In a preferred ^bodiment, fte evaluating step includes obtaining a sample (e.g., a 
tissue sample, e.g., a biopsy, or a fluid sample) from the subject, before and aflier treatment 
and conq)aiing the level of expressing of a 31939 nudeic acid (e.g., mRNA) or polypeptide 
before and after treatment 

10 In another aspect, the invention provides methods for evaluating the efficacy of a 

flierapeutic or prophylactic agent (e.g., an anti-neoplastic agent). ITie method includes: 
contacting a sample with an agent (e.g., a compound identified using the methods described 
herein, a cytotoxic agent) and, evaluating the expression of 31939 nuddc acid or 
polypeptide in the sample before and after the contacting step. A change, e.g., a decrease or 

15 increase, in the level of 31939 nucleic acid (e.g., mRNA) or polypeptide in the sample 

obtained after the contacting step, relative to the levd of expression in the sample before the 
contacting step, is indicative of the efficacy of the agent The levd of 31939 nucleic acid or 
polypeptide expression can be detected by any method described herein. In a preferred 
embodiment, the sample includes cells obtained from a cancerous tissue or aneuronal tissue. 

20 The invention also provides assays for determining the activity of or the presence or 

absence of 31939 polypeptides or nuddc add molecules m a biological sample, mduding 
for disease diagnosis. 

to furtibier aspect the mvention provides assays for detemuning the presence or 
absence of a genetic dteration in a 31939 polypeptide or nucldc add molecule, induding 

25 for disease diagnosis. 

In another aspect, the invention features a two dhnensional an^ having a plurality of 
addresses, eadi address of flie pluraUty being positionally distinguishable from each other 
address of the plurality, and each address of the plurality having a unique capture probe, 
e.g., a nuddc acid or peptide sequence. At least one address of the pluraUty has a capture 

30 probe that recognizes a 3 1939 molecule, hi one embodiment, the capture probe is anuddc 
add, e.g., a probe complementary to a 31939 nucleic add sequence. In another 
embodhnent, Ihe capture probe is a polypeptide, e.g., an antibody specific for 31939 
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polypeptides. Also featured is a method of analyzing a sample by contacting the sample to 
the aforementioned airay and detecting binding of the sample to the array. 

Otfier features and advantages of the invention will be apparent from the foUowing 
detailed description, and from Ae claims. 
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Briftf DftgcriptioP nf Hie Drawings 

Mgures lA-JC depict a cDNA sequence (SEQ ID N0:1) and predicted amino acid 
sequence(SEQIDNO:2)ofhuman31939. The methionine-initiated open reading frame of 
human 31939 (without the 5' and 3' untranslated regions) starts at nucleotide 187 and goes 
10 to nucleotide 2328 of SEQ ID N0:1 (shown also as coding sequence SEQ ID N0:3 

Figure 2 depicts a hydropathy plot of human 31939. Relative hydrophobic residues 
are shown above the dashed horizontal line, and relative hydrophilic residues are below the 
dashed horizontal line. Hxe cysteine residues (cys) are indicated by short vertical lines just 
below tiie hydropathy trace. The numbers corresponding to the amino acid sequence of 
15 human 31939 are indicated. Polypeptides of the invention include fragments which include: 
all or part of a hydrophobic sequence, i.e.. a sequence above the dashed line, e.g., tiie 
sequence from about residue 19 to 38, 570 to 595. and 624 to 644 of SEQ ID N0:2; aU or 
part of a hydrophiUc sequence, i.e., a sequence below tiie dashed line, e.g., the sequence of 
from about residues to 38, 570 to 595. and 624 to 644 of SEQ ID N0:2; a sequence which 
20 indudes a Cys, or a gfycosjiation site. 

mgure 3 depicts an aUgnment of tiie LRR domains, N-terminal LRR (LRRNT) 
domain, and C-termmal LRR (LRRCT) domam of human 31939 whh consensus amino acid 
sequenced derived from a Hidden Markov Models (PFAM Accession PFOl 462; PFAM 
Accession PF00560; and PFAM Accession PF01463). The upper sequences are tiie 
25 consensus amino acid sequences (SEQ ID N0:4 for LRRNT, SEQ ID N0:5 for LRR, and 
SEQ IDN0:6 for LRRCT, while tiie lower amino add sequence corresponds to amino acids 
56 to 85, 87 to 1 10, 1 1 1 to 134, 135 to 158, 159 to 182. 183 to 207. 208 to 229, 230 to 253. 
254 to 277, 278 to 301, and 311 to 362 of SEQ ID N0:2. 

Figures 4A and 4B depict an alignment of tiie immunoglobulin ag) domam of 
30 human 31939 witii a consensus amino acid sequence derived from aHidden Markov Modd 
(see e g., PFAM Accession PF00047 for Figure 4A). Hie upper sequence is tiie consensus 
amiiio acid sequence (in Figure 4A. SEQ ID N0:7; in Figure 4B, SEQ ID N0:8 for igvl_8. 
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SEQ ID N0:9 for igc2_5, and SEQ m NO:10 for IG_3c), whUe the lower amino acid 
sequence corresponds to amino acids 378 to 438 of SEQ ID N0:2 (Figure 4A), 380-438, 376 
to 443. and 370 to 454 of SEQ ID N0:2 (Figure 4B alignments, respectivdy). 

5 p^fliled Description 

The human 31939 sequence (Figure 1; SEQ ID N0:1), wtddi is ^proximately 2|493 
nucleotides long including untranslated regions, caatains a predicted melhionine-initiated 
coding sequence of about2142 nucleotides (aboutnudeotides 187 to 2328 of SEQIDNO:!, 
whidi correspond to about nucleotides 1 to 2142 of SEQ ID N0:3). The coding sequence 
10 encodes a 713 amino add protein (SEQ ID N0:2). Ihe human 31939 protein of SEQ ID 
N0:2 and Figure 2, includes an ammo-terminal hydrophobic amino acid sequence, 
consistent with a signal sequence, of about 38 amino adds (from amino add 1 to about 
amino acid 38 of SEQ K) N0:2). which upon deavage results in the production of a mature 
protein form. This mature protein form is approximately 675 amino add residues in lengfli 
15 (from about ammo acid 38 to amino acid 713 of SEQIDN0:2). Human 31939 contains the 
following regions or oflier structural features: 

anN-terminal Qeucine-rich repeat) LRR domain (PFAM Accession PF01462) 
located at about amino acid residues 56 to 85 of SEQ ID N0:2; 

nine LRR domains (PFAM Accession PF00560) located at about amino acid residues 
20 87 to 1 10, 1 1 1 to 134, 135 to 158, 159 to 182, 183 to 207. 208 to 229, 230 to 253, 254 to 
277, and 278 to 301 of SEQ ID N0:2; 

a Oterminal LRR domain (PFAM Accession PF01463) located at about amino add 

residues 3 1 1 to 362 of SEQ ID N0:2; 

an immunoglobulm domain (PFAM Accession PF00047) located at about amino 

25 add residues 378 to 438 of SEQ ID N0:2; 

a predicted extraceUular domain located at about amino adds 38 to 576 of SEQ ID 

N0:2; 

a predicted transmembrane domain located at about amino adds 577 to 597 of SEQ 
IDN0:2; 

30 a predicted cytoplasmic domain located at about amino acids 598 to 713 of SEQ ID 

N0:2; 

three predicted glycosaminoglycan attachment sites (PS00002) at about amino acids 
466 to 469, 640 to 643 and 679 to 682 of SEQ ID N0:2; 
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six predicted Casein Kinase H sites (PS00006) located at about amino acid residues 
73 to 76, 278 to 281, 427 to 430, 454 to 457, 509 to 512 and 693 to 696 of SEQ ID NO:2; 

eight predicted N-glycosylalion sites (PSOOOOl) at about amino adds 224 to 227, 
283 to 286, 333 to 336, 374 to 377, 400 to 403, 422 to 425, 444 to 447 and 452 to 455 of 
SEQmN0:2; 

nine predicted Protein Kinase C sites (PS00005) at about amino acids 73 to 75, 105 
to 107, 183 to 185, 254 to 256, 408 to 410, 509 to 511, 545 to 547. 549 to 551 and 574 to 

576 of SEQ ID N0:2; and 

sixteen predicted N-myrislylation sites (PS00008) from about amino acid residues 35 
to 40, 50 to 55, 128to 133, 247 to 252,388 to 393, 401 to 406, 432to 437, 443 to 448, 462 
to 467, 468 to 473, 475 to 480, 520 to 525, 580 to 585, 641 to 646, 680 to 685 and 703 to 
708ofSEQIDNO:2. 

For general information regarding PFAM identifiers. PS prefix and PF prefix domain 
identification numbers, refer to Sonnhammer et al. (1997) Protein 28:405-420 and 
15 http://www.psc.edu/general/ softwar^packages/pfam/pfam.html. 

A plasmid containing the nucleotide sequence encoding human 31939 (clone 
*Tbh31939FL'0 was deposited with American Type Culture Collection (ATCC), 10801 

University Boulevard, Manassas, VA 201 10-2209, on and assigned Accession 

Number . This deposit wiU be maintained under the terms of the Budapest Treaty on 

20 Ihe International Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. Hiis deposit was made merefy as a convenience for those of skill in the art and 
is not an admission 4at a deposit is required under 35 U.S.C. §112. 

The 31939 protein contains a significant number of structural characteristics in 
common with members of the LRR femily of proteins. The term "femily" when referring to 
25 the protein and nucleic acid molecules of the invention means two or more protems or 
nucleic acid molecules having a conunon structural domain or motif and having sufficient 
amino acid or nucleotide sequence homology as defined herein. Such fanuly members can 
be naturally or non-naturally occurring and can be from either the same or different species. 
For example, a family can contain a first protein of human origin as weU as other distinct 
30 proteins of human origin, or alternatively, can contain homologues of non-human origin, 
e.g., rat or mouse proteins. Members of a family can also have common fimctional 
characteristics. 
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A 31939 polyp^tide can include a «LRR domain" or regions homologous witti a 
-LRRdomain". As used herein, the term XBR domain" refers to a protein domain having 
anaminoaddsequenceofaboutl5to50aminoacidresidues,andhavin8abitscoreforthe 
aUgmnentoftiie sequence to the LRRdomain profile (PfemHMM) of at leaste. When the 

5 LRRisnotanN4enninalLRR(LRro«)orC.teiminalI^(I^Cn,it^ 

to 30 eg. about22to24aminoacidresiduesinlenglh(N-terminalLRRandC.termmal 
I^'arecUscussedbelowO. and canhaveabit score for theaUgnmentofthesequencet^to^ 

LRRdomainprome(PfamHMM)ofatleastabout6.8.10.orl2. Inapreferred | 
embodiment31939polypeptideorproteinhasa<«LRRdomain"oraregionwhichmcludes 

10 at least about 15 to 50 more preferably about 20 to 30 or 20 to 25 amino *:id residues m 
length andhas at least aboutSOo/c. 60%. 70o/o80o/o90% 950/0. 990/0. orlOOo/ohomology^lh 

a '«LRR domain," e.g.. a LRR domain of human 31939 (e.g., a domain ^ihin flie region of 
about residues 56 to 362 of SEQ ID N0:2). 

As used herein, the term «LRR region" refers to a polypeptide sequence mdudmg a 
15 LRRNr.multipleLRRdomains.andaLRRCT. For example, a 31939 polypeptide can 
have an LRR region from about amino acids 56 to 362 of SEQ ID NO:2. 

An LRR is characterized by a periodic distribution of hydrophobic amino acids, 
espedally leucine residues, separated by more hydrophiUc residues (Buchanan et al. (1996) 
Prog, Biopt^s. Moke. Biol 65:1-14; Kobe et al. (1994) Trends in Biochem. Sci. 19:415-121. 
20 fliecontentsofwhichareincorporatedhereinbyreference). Preferably, the LRR 
correspondsto ap^tstructural unit, consisting of ashort p-strand and an a-heUx 
approximatelypanaieltoeachother. The structural units are arranged so that Ihe p-strands 
and the heUces are parallel to a common axis, resulting in a nonglobular. horseshoe-shaped 
molecule with aparallel p-sheet lining in fte imier circumference of the horseshoe, and the 
25 helices flanking flie circumference. As sho^ in Figure 3. tiie LRR consensus sequence 
preferably contains leucines or other aliphatic residues at positions 2. 5, 7. 12. 16, 21 and 24. 
andasparagines.c^ineorthreonineatpositionlO. Preferred LRRs contain exclusively 
asparagines at position 10(Figure 3). however,a(ysteine residue may be substft^^^ 
position. ConsensussequencesderivedfromLRRsinindividualprotBinsoftencontam 
30 additionalconservedresiduesinpositionsofterthanthosementionedabove.llie 

hydrophobic consensus residues in the carboxyterminalparts of the repeats are conmio^^ 
spaced by 3, 4 or 7 residues. LRRs are usuaUy present in tandem, and the number of LRRs 
ranges from one to about 30 repeats. 
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To identify the presence of a "LRR" domain in a 31939 protein sequence, and make 
the determination that a polypeptide or protein of interest has a particular profile, the amino 
acid sequence of the protein can be searched against the Pfam database of HMMs (e.g., the 
P&m database, release 2.1) using the default parameters 

5 (httpy/ww.sanger.ac.uk/Soflware/Pfem/HMM_^^ For example, the hmmsf program, 
which is available as part of tiie HMMER package of search programs, is a femily specific 
defeult program fsx MILPAT0063 and a score of 15 is fee defeult threshold score for 
determining a hit Alternatively, the threshold score for determining a hit can be lowered 
(e.g., to 8 bits). A description of the Pfem database cm be found in Sonhammer et al 

10 (1997) Proteins 28(3):405-420 and a detailed description of HMMs can be found, for 
example, in Gribskove/flI(1990)MerA.£>igw2o/. 183:146-159; GribskovefaA^ 
Natl Acad. Sci. USA 84:4355-4358; Kroghefa/.(1994) J.Moi BioL 235:1501-1531; and 
Stultz et a/. (1993) Protein Sci. 2:305-314, the contents of which are incorporated herem by 
reference. A search was performed against the HMM database resulting in fte 

15 identification of multiple 'XRR" domains in the amino acid sequence of human 31939 in the 
region of about residues 56 to 362 of SEQ ID N0:2 (see Figure 1). 

In some embodunents, a 31939 protein includes an N-terminal LRR (LRRNT) 
domain. As used herein, the term "N-terminal LRR" (LRRNI) refers to a domain often 
found at the-N-terminus of a series of tandem LRRs, having an amino acid sequence of 

20 about 15 to 50 amino acids, and having a bit score for the alignment of the sequence to the 
LRRNT domain profile (Pfem HMM) of at least 8. Preferably an LRRNT includes about 15 
to 50, more preferabfy about 20 to 35, e.g., 29 to 34 amino acid residues, and has a bit score 
for &e alignment of the sequence to the leucine rich-repeat (HMM) of about 10, 15, 20, 30, 
or greater. The N-termmal LRR (HMM) has been assigned PFAM Accession PF01462 

25 (httpV/pfam.wustLedu). To identify the presence of a "LRRNT* domain in a 31939 protein 
sequence, and make the determination that a polypeptide or protein of interest has a 
particular profile, the amino add sequence of the protein can be searched against the Pfiim 
database of HMMs (e.g., the Pfem database, release 2.1) as described above. A search was 
performed against the HMM database resulting in fte identificaticm of a "LRRNT** domain 

30 in the amino acid sequence of human 31939 at about amino add residues 56 to 85 of SEQ 
ID N0:2 (see Figures land 3); an alignment of the LRRNT (ammo adds 56 to 85) of human 
31939 with a consensus amino acid sequence derived firom a hidden Markov model is 
depicted in Figure 3. 
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In some embodiments, a 31939 protein includes a C-terminal LRR (LRRCT) 
domain. As used herein, the term "C-terminal LRR" (LRRCT) refers to a domain often ' 
found at flie C-terminus of a series of tandem LRRs, having an amino add sequence of 
about 25 to 60 amino adds, and having a bit score for the alignment of the sequence to the 

5 LRR domain profile (PfemHMM) of atleast 10. Preferably an LRRCT includes aboutSOto 
60, more preferably about 40 to 50, and has a bit score for the aKgnment of the sequent* to 
the leucine rich-repeat (HMM) of about 10, 15, 20, 25 or greater. Ttie C-terminal LRR' 
(HMM) has been assigned PFAM Accession PF01463 (httpV/pfem. wustt.edu/). To identify 
Ae presHQce of a TRRCT" domain in a 31939 protein sequence, and make ttie 

10 determination that a polypeptide or protein of interest has a particular proffle, the amino acid 
sequence of the protein can be searched against Ae Pfam database of HMMs (e.g., die Pfem 
database, release 2. 1 ), as described above. A search was performed against the HMM 
database resulting in the identification of a 'XRRCT' domain in the amino acid sequence of 
human 31939 at about amino acid residues 311 to 362 ofSEQ ID N0:2 (see Figures 1 

15 and 3); an alignment of the LRRCT (amino acids 31 1 to 362) of human 31939 widi a 
' consensus ammo acid sequence derived from a hidden Markov model is depicted in 
Figures. 

In some embodiments, a 3 1939 protein includes an immunoglobulin domain. As 

used herein, an '^immunoglobulin domain" (also refened to herein as "Ig") refers to an 

20 amino add sequence of about 45 to 85 amino acids in length and having a bit score for the 
alignment of the sequence to the Ig family proffle (Pfem HMM) of at least 15. Preferably, 
an immunoglobuKn domain is about 50 to 80 amino adds, more preferably about 55 to 65 
amino acids in length and a bit score for the afignment of the sequence to the Ig family 
Hidden Markov Model CHDMM) of at least 15, 20, or 25. Ihe Ig femily HMM has been 

25 assigned ttie PFAM Accession PF00047. To identify the presence of an Ig domain in a 
31939 protein sequence, and make the determination ttiat a polypeptide or protein of interest 
has a particular proffle, the amino acid sequence of the protein can be searched against the 
Pfem database of HMMs (e.g., the Pfam database, release 2.1) usmg the defeult parameters 
(http://www.sanger.ac.uk/Software/Pfem/HMM_search), as described above. Consensus 

30 amino acid sequences for immunoglobulin doniains are shown aligned to an 

immunoglobulin domain of a 31939 protein at about residues 378 to 438 of SEQ ID NO:2 in 
Figure 4A and to tiie same region in Figure 4B (consensus sequences, SEQ ID N0:7, 8, 9, 
and 10). The consensus sequences in Figure 4B are derived from the SMART database 
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(Simple Modxdar Architecture Research Tool, http://smart.enibl-heidelberg.de/) of HMMs 
as described in Schultz et al (1 998), Proc. NatL Acad ScL USA 95:5857 and Schultz et al 
(200) Nucl Acids Res 28:231 . The more conserved residues in the consensus sequence are 
indicated by uppercase letters and the less conserved residues in &e consensus sequence are 
5 indicated by lowercase letters. Immunoglobulin domains are present in a variety of proteins 
(including secreted and membrane-associated proteins). Membrane-associated proteins may 
be involved in protein-protein, and protein-ligand interaction at the cell surface, and thus 
may influence diverse activities including cell surface recognition and/or signal 
transduction. 

10 In one embodiment, a 3 1 939 protein includes at least one extracellular domaia 

^Vhen located at the N-terminal domain the extracellular domain is refeired to herein as an 
"N-terminal extracellular domain", or as an N-terminal extracellular loop in ttie amino acid 
sequence of the protein. As used herein, an *'N-terminal extracellular domain" includes an 
amino acid sequence having at least about 100, 200, 300, more preferably at least about 400, 
15 or 500 amino acid residues, and is located outside of a cell or extracellularly. The C- 
terminal amino acid residue of a *lsf-terminal extracellular domain" is adjacent to an N- 
terminal amino acid residue of a transmembrane domain in a naturally-occurring 3 1 939, or 
31939-like protein. For example, an N-terminal extracellular domain is located at about 
amino acid residues 38 to 576 of SEQ ID N0:2. Preferably, the N-terminal extracellular 
20 domain is capable of interacting (e.g., binding to) vwth an extracellular signal, for example, a 
ligand or a cell surface receptor. Most preferably, the N-terminal extracellular domain 
mediates a protein-protein interaction, signal transduction and/or cell adhesion. Preferably, 
the extracellular domain includes one or more of: at least one leucine-rich repeat; or at least 
one immunoglobulin domain. 

In a preferred embodiment, a 3 1939 polypeptide or protein has an ^'N-terminal 
extracellular domain" or a region which includes at least about 100, preferably at least about 
200, more preferably at least about 300, more preferably at least about 400, an more 
preferably at least about 500 amino acid residues and has at least about 60%, 70% 80% 90% 
95%, 99%, or 100% homology with an "N-terminal extracellular domain," e.g., the N- 
terminal extracellular domain of human 31939 (e.g., residues 38 to 576 of SEQ ID N0:2). 

In another embodiment, a 3 1 939 polypeptide or protein includes at least one 
transmembrane domain. As used herein, the tenn 'transmembrane domain" includes an 
amino acid sequence of about 16 amino acid residues in lengA v^ch spans the plasma 
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membrane. More preferably, a transmembrane domain includes about at least 16, 1 8, or 20, 
amino acid residues and spans the plasma membrane. Transmembrane domains are rich' in 
Iqrdrophobic residues, and ^ically have an a-helical structure. In a preferred embodiment, 
at least 50%, 60%, 70%, 80%, 90%, 95% or more of the amino acids of a transmembrane 

5 domain are hydrophobic, e.g., alanine, valine, phenylalanine, melfaionme, leucine 

isoleudne, ^osine, or tiyptophaa Transmembrane domains are described in, for exanQ)le, 
httoy/^fanLWUstLedu/bgi-bin/getdesc?name=*7tm-l, andZagottaW.N. etal,(1996)>4««»fl/ 
Rev. Neuroscu 19: 235-63, Ae contaits of \Aidi are incorporated herein by reference.) 
In a preferred embodiment a 31939 polypeptide or protein has at least one 

10 transmembrane domain or a region which includes at least 16, 18, or 20 amino add residues 
and has at least about 60%, 70% 80% 90% 95%, 99%, or 100% homology wifli a 
"transmembrane domain," e.g., residues at about 577 to 597 of SEQ ID N0:2. Preferabfy, 
±e transmembrane domain transduces a signal, e.g., an extracellular signal across a cell 
mwnbrane, and/or activates a signal transduction pathway. 

15 In another embodiment, a 31939 protein includes a "C-terminal cytoplasmic 

domain", also referred to herein as a C-terminal <^oplasmic tail, in the sequence of the 
proteia As used herein, a "C-terminal cytoplasmic domain" includes an amino acid 
sequence having a length of at least about 60, preferably at least about 70, more preferably 
at least about 80, and more preferably at least about 90 amino acid residues and is located 

20 within a cell or within the cytoplasm of a cell Accordingly, the N-terminal amino acid 
residue of a "C-terminal cytoplasmic domain" is adjacent to a C-terminal amino acid residue 
ofa transmembrane domain in a naturally-occuning 31939 or 31939-Uke protein. For 
exanq)le, a C-terminal cytoplasmic domain is found at about amino acid residues 598 to 7 13 
ofSEQIDNO:2. 

25 In a preferred embodiment, a 3 1939 polypeptide or protein has a C-terminal 

cytoplasmic domain or a region which includes about 50 to 170, preferably about 80 to 140, 
preferably about 1 10 to 120, more preferably about 115 amino acid residues and has at least 
about 60%, 70% 80% 90% 95%, 99%, or 100% homology with an X-terminal cytoplasmic 
domain," e.g., the C-terminal cytoplasmic domain of human 31939 (e.g., residues 598 to 713 

30 of SEQ ID NO:2). For example, the C-terminal cytoplasmic domain can transduce a 31939 
signaling activity within a cell. 

In some embodiments, a 31939 protein includes a signal sequence. As used herein, 
the term "signal sequence" means a peptide of about 18 to 80 amino acid residues, which 
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occurs at the N-terminus of secreted or integral membrane proteins, and vMch contains a 
high proportion of hydrophobic amino acid residues. A signal sequence often contains 
about 15 to 70 amino acids residues, and preferably about 20 to 58 amino acid residues, and 
has about 40-90%, preferably about 50-90%, and more preferably about 55-90% 
5 hydrophobic amino acid residues (e.g., alanine, valine, leucine, isoleucine, phenylalanine, 
tyrosine, tryptophan or proline). Such a signal sequence, also referred to in fhe art as a 

''signal peptide," fimctions to direct a protein containing such a sequence into a lipid bilayer. 

I 

For example, in one embodiment, a 31939 protems contains a signal sequence at about 
amino acid residues 1 to 37 of SEQ ID N0:2. The signal sequence is cleaved during 

10 processing to yield a mature protein. In some embodiments, a niature 3 1939 protein 
corresponds to amino acids 38 to 713 of SEQ ID N0:2. 

In some embodiments, the 31939 protein includes at least one, two, three, four, five, 
six, seven, or preferably eight N-glycosylation sites; at least one glycosaminoglycan 
attachment site; at least one, two, three, four, five, six, seven, eight, ten, twelve, fifteen or 

15 preferably sixteen N-myristylation sites; and at least one amidation site. 

As the 31939 polypeptides of the invention may modulate 31939-mediated activities, 
they may be usefiil as of for developing novel diagnostic and therapeutic agents for 31939- 
mediated or related disorders, as described below. 

As used herein, a "31939 activity'', 'biological activity of 31939" or 'functional 

20 activity of 31939", refers to an activity exerted by a 31939 protein, polypeptide or nucleic 
acid molecule. For example, a 3 1 939 activity can be an activity exerted by 3 1939 in a 
physiological milieu on, e.g., a 31939-responsive cell or on a 31939 substrate, e.g., a protein 
substrate. A 3 1939 activity can be determined in vivo or in vitro. In one embodiment, a 
314939 activity is a direct activity, such as an association with a 3 1939 target molecule. A 

25 "target molecule" or "binding partner" is a molecule with which a 31939 protein binds or 
interacts in nature. In an exemplary embodiment, 31939 is areceptor, e.g., a receptor for an 
extracellular signaling molecule such as soluble polypeptide hormone or an extraceUular 
matrix protein. A 3 1939 activity can also be an indirect activity, e.g., a cellular signaling 
activity mediated by interaction of the 31939 protein with a 31939 receptor. 

30 Based on their structural features, the 31939 molecules of the present invention can 

have similar biological activities as LRR family members. For example, fte 3 1939 proteins 
of the present invention can have one or more of the following activities: (1) modulation of 
growth and/or differentiation of a cell; (2) modulation of cell attachment and/or adhesion. 
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(3) modulation of cell migration, (4) modulation of embiyonic development and/or 
differentiation; (5) regulation of tissue maintenance; and/or (6) modulation of neural ^ 
development, e.g., axonal growth and/or guidance, and/or maintenance. For example, 31939 
proteins may regulate processes including embiyonic development and tissue differentiation. 

5 Examples of such embiyonic development and tissue differentiation include neural 
development (such as axonal growth and/or guidance or growth), as well as tissue | 
maintenance and function. In addition, 31939 may be involved in pathological conditions 
such as neoplastic transformation, tumor progression, and neuronal degeneration | 

Thus, tiie 31939 molecules can act as novel diagnostic targets and therapeutic agents 

10 for controlling a cell proliferative or differentiative disorder, e.g., cancer disorders, or a 
neuronal disorder. 

Examples of cellular proliferative and/or differentiative disorders include cancer, 
e.g., carcinoma, sarcoma, metastatic disorders or hematopoietic neoplastic disorders, e.g., 
leukemias. A metastatic tumor can arise from a multitude of primary tumor types, including 

15 but not limited to those of prostate, colon, lung, breast and liver origin. 

As used herein, the terms "cancer'', ^'hyperproliferative" and ^'neoplastic" refer to 
cells having the capacity for autonomous growfli. Examples of such cells include cells 
having an abnormal state or condition characterized by rapidly proliferating cell growth. 
Hyperproliferative and neoplastic disease states may be categorized as pathologic, i.e., 

20 characterizing or constituting a disease state, or may be categorized as non-pathologic, i.e., a 
deviation from normal but not associated with a disease state. The term is meant to include 
all ^pes of cancerous grov^lhs or oncogenic processes, metastatic tissues or malignantly 
transformed cells, tissues, or organs, irrespective of histopathologic type or stage of 
invasiveness. *Talhologic hyperproliferative" cells occur in disease states characterized by 

25 maUgnant tumor growfli. Examples of non-pafliologichyp^roliferative cells include 
proliferation of cells associated wifli wound repair. 

The teims "cancer" or "neoplasms" include malignancies of the various organ 
systems, such as affecting lung, breast, fliyroid, lymphoid, gastrointestinal, and genito- 
urinaiy tract, as well as adenocarcinomas which include malignancies such as most colon 

30 cancers, renal-cell carcinoma, prostate cancer and/or testicular tumors, non-small cell 
carcinoma of the lung, cancer of the small intestine and cancer of the esophagus. 

The term "carcinoma" is art recognized and refers to malignancies of epittielial or 
endocrine tissues including respiratory system carcinomas, gastrointestinal system 
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carcinomas, genitourinaiy system carcinomas, testicular carcinomas, breast carcinomas, 
prostatic carcinomas, endocrine system carcinomas, and melanomas. Exemplary carcinomas 
indude those forming from tissue of the cervix, lung, prostate, breast, head and neck, colon 
and ovary. The term also includes carcinosarcomas, e.g., which include maUgnant tumors 
5 composed of carcinomatous and sarcomatous tissues. An "adenocarcinoma" refers to a 
carcinoma derived from glandular tissue or in \^di the tumor celb form recognizable 

glandular structures. 

The term "sarcoma" is art recognized and refers to malignant tumors of 

mesenchymal derivation. 

10 Examples of cancers or neoplastic conditions, in addition to the ones described 

above, include, but are not limited to. a fibrosarcoma, myosarcoma. Uposarcoma, 
chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, 
lymphangiosarcoma, lymphangioendotheliosarcoma, synovioma, mesothelioma. Swing's 
tumor, leiomyosarcoma, rhabdomyosarcoma, gastric cancer, esophageal cancer, rectal 

15 cancer, pancreatic cancer, ovarian cancer, prostate cancer, uterine cancer, cancer of tiie head 
and neck, skin cancer, brain cancer, squamous ceU carcinoma, sebaceous gland carcinoma, 
papillary carcinoma, papillaiy adenocarcinoma, cystadenocarcinoma, medullary carcinoma, 
bronchogenic carcinoma, renal cell carcinoma, hepatoma, bUe duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wihn's tumor, cervical cancer, 

20 testicular cancer, smafl ceU lung carcinoma, non-smaU ceU lung carcinoma, bladder 
carcinoma, epithelial carcinoma. gUoma, astrocytoma, medulloblastoma, 
craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, 
oligodendroglioma, meningioma, melanoma, neuroblastoma, retinoblastoma, leukemia, 

lymphoma, or Kaposi sarcoma 
25 Additional examples ofproliferative disorders include hematopoietic neoplastic 

disorders. As used herein, the term "hematopoietic neoplastic disorders" includes diseases 
involving hyperplastic/neoplastic cells of hematopoietic origin. A hematopoietic neoplastic 
disorder can arise from myeloid, lymphoid or eiythroid lineages, or precuisor cells thereof 
Preferably, the diseases arise from poorly differentiated acute leukemias, e.g., erythroblastic 
30 leukemia and acute megakaiyoblastic leukemia Additional exemplary myeloid disorders 
include, but are not limited to, acute prorayeloid leukemia (APML), acute myelogenous 
leukemia (AML) and chronic myelogenous leukemia (CML) (reviewed in Vaickus, L. 
(1991) CritRev. in OncoUHemotol. 1 1:267-97); lymphoid maUgnancies include, but arenot 
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limited to acute lymphoblastic leukemia (ALL) which includes B-lineage ALL and T- 
lineage ALL, chronic lymphocytic leukemia (CLL), prolymphocytic leukemia (PLL), haiiy 
cell leukemia OHLL) and Waldenstrom's macroglobulinemia (WM). Additional forms of 
maUgnant lymphomas include, but are not linaited to non-Hodgkin lymphoma and variants 
thereof peripheral T ceU lymphomas, adult T ceU leukemia/lymphoma (ATL), cutaneous T- 
ceU lymphoma (CTCL), large granular lymphocytic leukemia (LGF), Hodgkin's disease tod 



Examples of cellular proliferative and/or differentiative disorders of tie breast ( 
include, but are not limited to, proliferative breast disease mduding, e.g., epidielial 
10 hyperplasia, sclerosing adenosis, and small duct papillomas; tumors, e.g., stromal tumors 
such as fibroadenoma, phyllodes tumor, and sarcomas, and epithelial tumors such as large 
duct papiUoma; carcinoma of the breast including in situ (noninvasive) carcinoma that 
includes ductal carcinoma in situ (including Pagefs disease) and lobular carcinoma in situ, 
and invasive (infiltrating) carcinoma including, but not limited to, invasive ductal 

15 carcinoma, invasive lobular carcinoma, medullary carcinoma, coUoid (mucinous) 
carcinoma, tubular carcinoma, and invasive papillary carcinoma, and misceUaneous 
malignant neoplasms. Disorders in flie male breast include, but are not limited to, 
gynecomastia andcardnoma. 

Examples of ceUular proliferative and/or differentiative disorders of the lung include, 

20 but are not limited to. bronchogenic carcinoma, including paraneoplastic syndromes, 
bronchioloalveolar carcinoma, neuroendocrine tumors, such as bronchial carcinoid, 
misceUaneous tumors, and metastatic tumors; pathologies of the pleura, including 
inflammatory pleural effiisions, noninflammatory pleural effusions, pneumothorax, and 
pleural tumors, including soUtaiy fibrous tumors (pleural fibroma) and malignant 

25 meso&elioma. 

Examples of cellular proliferative and/or differentiative disorders of flie colon 
include, but are not limited to, non-neoplastic polyps, adenomas, femffial syndromes, 
colorectal carcinogenesis, colorectal carcinoma, tod carcinoid tumors. 

Examples of cellular proliferative and/or differentiative disorders of the Kver include, 
30 but are not limited to, nodular hyperplasias, adenomas, and mahgnant tumors, including 
primary carcinoma of flie liver and metastatic tumors. 

Examples of cellular proliferative and/or differentiative disorders of the ovary 
include, but are not limited to, ovarian tumors such as, tumors of coelomic epitheUum, 



Reed-Stembeig disease. 
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serous tumors, mucinous tumors, endometeriod tumors, clear cell adenocarcinoma, 
cystadenofibroma, brenner tumor, surface epithelial tumors; germ cell tumors such as 
mature (benign) teratomas, monodermal teratomas, immature malignant teratomas, 
dysgerminoma, endodermal sinus tumor, choriocarcinoma; sex cord-stomal tumors such as» 
5 granulosa-theca cell tumors, thecoma-jfibromas, androblastomas, hill cell tumors, and 
gonadoblastoma; and metastatic tumors such as Krukenberg tumors. 

Examples of neurological disorders include, but are not limited to disorders 
involving neurons, and disorders involving glia, such as astroc^es, oligodendroQ^es, 
ependymal cells, and microglia; cerebral edema, raised intracranial pressure and herniation, 
and hydrocephalus; malformations and developmental diseases, such as neural tube defects, 
forebrain anomalies, posterior fossa anomalies, and syringomyelia and hydron^elia; 
perinatal brain injuiy; transmissible spongiform encephalopathies (prion diseases); 
demyelinating diseases, including multiple sclerosis, multiple sclerosis variants, acute 
disseminated encephalomyelitis and acute necrotizing hemorrhagic encephalomyeUtis, and 
other diseases ivilh demyelination; degenerative diseases, such as degenerative diseases 
affecting the cerebral cortex, including Alzheimer disease and Pick disease, degenerative 
diseases of basal ganglia and brain stem, including Parkinsonism, idiopathic Parkinson 
disease (paralysis agitans), progressive supranuclear palsy, corticobasal degeneration, 
multiple system atrophy, including striatonigral degeneration, Shy-Drager syndrome, and 
olivopontocerebellar atrophy, and Huntington disease; spinocerebellar degenerations, 
including spinocerebellar ataxias, including Friedreich ataxia, and ataxia-telangiectasia, 
degenerative diseases affecting motor neurons, including amyotrophic lateral sclerosis 
(motor neuron disease), bulbospinal atrophy (Kennedy syndrome), and spinal muscular 
atrophy; tumors, such as gliomas, including astrocytoma, including fibrillaiy (diffiise) 
astrocytoma and glioblastoma multiforme, piloQtic astrocytoma, pleomorphic 
xanttioastrocytoma, and brain stem glioma, oligodendroglioma, and ependymoma and 
related paraventricular mass lesions, neuronal tumors, poorly differentiated neoplasms, 
including medulloblastoma, other parenchymal tumors, including primaiy brain lymphoma, 
germ cdl tumors, and pineal parenchymal tumors, meningiomas, metastatic tumors, 
paraneoplastic syndromes, peripheral nerve sheath tumors, including schwannoma, 
neurofibroma, and malignant peripheral nerve sheath tumor (malignant schwannoma), and 
neurocutaneous syndromes (phakomatbses), including neurofibromotosis, including Type 1 
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nexirojBbromatosis (NFl) and TYPE 2 neurofibromatosis (NF2), tuberous sclerosis, and Von 
Hippel-Lindau disease. ' 

The 31939 protein, fi-agments thereof and derivatives and other variants of the 
sequence in SEQ E) N0:2 thereof are collectively referred to as "polypeptides or proteins of 
5 the invention** or *3 1939 polypeptides or proteins". Nucleic acid molecule encoding sudi 
polypeptides or proteins are collectively referred to as ^'nucleic acids of the invention" or 
^31939 nucleic acids." 31939 molecules refer to 31939 nucleic adds, polypeptides, anil 
antibodies. / 
As used herein, tiie term 'Nucleic acid molecule" includes DNA molecules (e.g., a 
10 cDNA or genomic DNA), RNA molecules (e.g., an mRNA) and analogs of the DNA or 
KNA A DNA or RNA analog can be synthesized firom nucleotide analogs. The nucleic 
add molecule can be single-stranded or double-stranded, but preferably is double-stranded 
DNA. 

The term 'Isolated nucleic acid molecule" or **purified nucleic acid molecule" 
15 includes nucleic acid molecules that are separated fi-om other nucleic acid molecules present 
in the natural source of the nucleic acid. For example, witii regards to genomic DNA, the 
term '^isolated" includes nucleic acid molecules which are separated from the chromosome 
witii which the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid 
is free of sequences which naturally flank the nucleic acid O.e., sequences located at the 5' 
20 and/or 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 
nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, O.S kb or 0.1 kb of 5' 
and/or 3' nucleotide sequmces which naturally flank the nucleic acid molecule in genomic 
DNA of the cell from whidi the nucleic add is derived. Moreover, an 'Isolated" nudeic 
acid molecule, sudi as a cDNA molecule, can be substantially free of otiier cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals ^en chemically synthesized. 

As used herein, the term 'liybridizes under low stringency, medium stringency, hig^ 
stringenQT, or very high stringency conditions" describes conditions for hybridization and 
washing. Guidance for perfonning hybridization reactions can be found in Curr^nr 
Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6, whidi is 
incorporated by reference. Aqueous and nonaqueous methods are described in that 
reference and either can be used. Specific hybridization conditions referred to herein are as 
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follows: 1) low stringency hybridization conditions in 6X sodium chloride/sodium citrate 
(SSC) at about 45^C, followed by two washes in 0.2X.SSC, 0.1% SDS at least at 50^C (the 
temperature of the washes can be increased to 55^C for low stringency conditions); 2) 
medium stringency l^^bridization conditions in 6X SSC at about 4S''C, followed by one or 
5 more washes in 0.2X SSC, 0. 1 % SDS at 60**C; 3) high stringency tybridization conditions 
in 6X SSC at about 45^C, followed by one or more washes in 0.2X SSC, 0, 1% SDS at 65^C; 
and preferably 4) very high stringency hybridization cqnditions are 0.5M sodium phosphate, 
7% SDS at 65^C, followed by one or more washes at 0.2X SSC, 1% SDS at 65^C. Very 
high stringency conditions (4) are the preferred conditions and the ones that should be used 

10 imless otherwise specified. 

Preferably, an isolated nucleic acid molecule of the invention that hybridizes under a 
stringency condition described herein to the sequence of SEQ ID N0:1 or SEQ ID N0:3, 
corresponds to a naturally-occurring nucleic acid molecule. 

As used herein, a "naturally-occurring" nucleic acid molecule refers to an KNA or 

IS DNA molecule having a nucleotide sequence that occurs in nature. For example a naturally 
occurring nucleic acid molecule can encode a natural proteia 

As used herein, Ae terms "goie" and 'Vecombmant gene" refer to nucleic acid 
molecules which include at least an open reading jGrame encoding a 3 1 939 protein. The gene 
can optionally further include non-coding sequences, e.g., regulatoiy sequences and introns. 

20 Preferably, a gene encodes a mammalian 31939 protein or derivative thereof 

An ^Isolated" or **purified" polypeptide or protein is substantially free of cellular 
material or other contaminating proteins from the cell or tissue source from wfaidi tiie 
protein is derived, or substantially free from diemical precursors or other chemicals when 
chemically synfliesized. "Substantially free" means tiiat a preparation of 3 1 939 protein is at 

25 least 10% pure. In a preferred embodiment, the preparation of 3 1939 protein has less than 
about 30%, 20%, 10% and more preferably 5% (by diy weight), of non-31939 protein (also 
referred to herein as a "contaminating protein"), or of chemical precursors or non-31939 
chemicals. When the 3 1939 protein or biologically active portion thereof is recombinantly 
produced, it is also preferably substantially free of culture medium, i.e., culture medium 

30 represents less than about 20%, more preferably less than about 10%, and most preferably 
less than about 5% of the volume of tiie protein preparation. The invention includes isolated 
or purified preparations of at least 0.01, 0.1, 1.0, and 10 milligrams in dry wei^t 
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A ''non-essential" amino acid residue is a residue that can be altered from the wild- 
type sequence of 31939 without abolishing or substantially altering a 31939 activity. ' 
Preferabfy the alteration does not substantialty alter the 31939 activity, e.g., the activity is at 
least 20%, 40%, 60%, 70% or 80% of wild-type. An **essential" amino add residue is a 
5 readue that, when altered from the wild-type sequence of 31939, results in abolishing a 
31 939 activity such that less flian 20% of the wild-type activity is present For example' 
conserved amino acid residues in 31939 are predicted to be particulariy unamenable to 

alteration. 1 
A "conservative amino acid substitution" is one in which the amino acid residue is 

10 replaced with an amino acid residue having a similar side chain. Families of amino acid 
residues having similar side chains have been defined in the art These femilies include 
amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., 
aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, 
glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, vahne, 

15 leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in a 
31939 protein is preferably replaced with another amino acid residue from the same side 
chain family. Alternatively, in another embodiment, mutations can be introduced randomly 

20 along all or part of a 3 1939 coding sequence, such as by saturation mutagenesis, and the 
resultant mutants can be screened for 3 1939 biological activity to identify mutants that retain 
activity. FoUowing mutagenesis of SEQ ID NO: 1 or SEQ ID N0:3, the encoded protein can 
be eiqpressed recombinantly and the activity of the protein can be determined. 

As used herein, a '^biologically active portion'' of a 31939 protein includes a 

25 fiagmmt of a 31939 protein which participates in an interaction, e.g., an intramolecular or 
an inter-molecular interaction. An inter-molecular interaction can be a specific binding 
mteraction or an enzymatic interaction (e.g., the interaction can be transient and a covalent 
bond is formed or broken). An inter-molecularinteraction can be between a 3 1939 molecule 
and anon-31939 molecule or between a first 31939 molecule and a second 31939 molecule 

30 (e.g., a dimerization interaction). Biologically active portions of a 31939 protein include 
peptides comprising amino acid sequences suflBdently homologous to or derived from the 
amino acid sequence of the 31939 protein, e.g., the amino acid sequence shown in SEQ ID 
N0:2, which include less amino acids than the fiill length 3 1939 proteins, and exhibit at 
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least one activity of a 3 1939 proteia Typically, biologically active portions comprise a 
domain or motif with at least one activity of the 31939 protein, e.g., binding an extracellular 
signaling molecule. A biologically active portion of a 31939 protein can be a polypeptide 
which is, for example, 10, 25, 50, 100, 200 or more amino acids in Iragth. Biologically 
5 active portions of a 3 1 939 protein can be used as targets for developing agents which 
modulate a 3 1939 mediated activity, e.g., binding an extracellular signaling molecule. 

Calculations of homology or sequence identity between sequences (the temis are 
used interchangeably herein) are performed as follows. 

To determine the percent identity of two amino add sequences, or of two nucleic 

10 add sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
be introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
optimal alignment and non-homologous sequences can be disregarded for comparison 
purposes). In a prefeired embodiment, the length of a reference sequence aligned for 
comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, 

15 60%, and even more preferably at least 70%, 80%, 90%, 100% of the length of fte reference 
sequence. The amino acid residues or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position in the first sequence is occupied 
by the same amino acid residue or nucleotide as the corresponding position in the second 
sequence, then the molecules are identical at that position (as used herein amino acid or 

20 nucleic acid "identity*' is equivalent to amino acid or nucleic acid 'homology"). 

The percent identity between the two sequences is a function of the number of identical 
positions shared by the sequences, taking into accoxmt the number of gaps, and the length of 
each gap, v^chneed to be introduced for optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 

25 sequences can be accomplished using a mathematical algorithm. In a preferred 

embodunent, the percent idaitity between two amino add sequences is determined using the 
Needleman and Wunsch ((1970) J. MoL Biol 48:444-453 ) algorithm whidi has been 
incorporated into the GAP program in the GCG software package (available at 
http://www.gcg.com), using eittier a Blossum 62 matrix or a PAM250 matrix, and a gap 

30 weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet anotfier 
preferred embodiment, the percent id^tity between two nucleotide sequences is determined 
using the GAP program in the GCG software package (available at http://www.gcg.com), 
using aNWSgapdna.CMP matrix and a gap weigjit of 40, 50, 60, 70, or 80 and a lengfli 
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waghtof 1. 2. 3, 4. 5, or 6. A particularly preferred set of parameters (and the one that 
should be used unless otherwise specified) are a Blossum 62 scoring matrix with a gap ' 
penally of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. 

The percent identity between two amino acid or nucleotide sequences can be 
5 determined using the algorithm of E. Meyers and W. Miller ((1989) CABIOS. 4:11-17) 
which has been incorporated into the AUGN program (version 2.0), using a PAM120 , 
weight residue table, a gap length penally of 12 and a gap penalty of 4. | 

Hie nucleic acid and protein sequences described herein can be used as a "queiy 
sequenced© perform a search against pubUc databases to. for ejiample. identify other fe^^ 
10 members or related sequences. Such searches can be performed using the NBLAST and 
XBLAST programs (version 2.0) of Altschul. etal (1990) J. MoL Biol 215:403-10. 
BLAST nucleotide seardies can be performed with the NBLAST program, score = 100, 
wordlength = 12 to obtam nucleotide sequences homologous to 31939 nucleic acid 
molecules of the invention. BLAST protein searches can be performed with the XBLAST 
15 program, score = 50. wordlength = 3 to obtain amino acid sequences homologous to 31939 
protein molecules of the inventioa To obtain gapped alignments for comparison purposes. 
Gapped BLAST can be utilized as described in Altschul et ai, (1997) Nucleic Acids Bes. 
25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the defeult 
parameters of the respective programs (e.g.. XBLAST and NBLAST) can be used. See 
20 http7/www.ncbi.nhn.nih.gov. 

Particularly preferred 31939 polypeptides of the present invention have an amino 
acid sequence substantially identical to the amino add sequence of SEQ ID N0:2. lii the 
context of an amino acid sequence, the term "substantially identical" is used herein to refer 
to a first amino acid that contains a sufficient or minimum number of amino acid residues 
25 that are i) identical to, or ii) conservative substitutions of aUgned amino acid residues in a 
second ammo acid sequence sudi that the first and second amino acid sequences can have a 
common structural domain and/or common fimctional activity. For example, amino acid 
sequences fl»at contain a common structural domain having at least about 60%, or 65% 
identity, likely 75% identity, more likely 85«/o, 90%. 91%, 92%, 93%. 94«/o. 95«/o. 96%. 
30 97%, 98% or 99% identic to SEQ ID N0:2 are termed substantially identical. 

' In ±0 context of nucleotide sequence, the term "substantiaUy identical" is used 
herein to refer to afirstnucleic acid sequence that contains a sufficient or minimum number 
of nucleotides that are identical to aligned nucleotides in a second nucleic add sequence 
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such that the first and second nucleotide sequences encode a polypeptide having common 
functional activity, or encode a common structural polypeptide domain or a common 
functional polypeptide activity. For example, nucleotide sequences having at least about 
60%, or 65% identity, likely 75% identity, more likely 85%, 90%. 91%, 92%, 93%, 94%, 
5 95%, 96%, 97%, 98% or 99% identity to SEQ ID NO: 1 or 3 are termed substantially 
identical. 

**Misexpression or aberrant expression", as used herein, refers to anon-wildtype 
pattern of gene expression at the RNA or protein level. It includes: expression at non-wild 
type levels, i.e., over- or under-expression; a pattern of expression that differs from wild 
10 type in terms of the time or stage at which the gene is expressed, e.g., mcreased or decreased 
expression (as compared witii wild type) at a predetermined developmental period or stage; 
a pattem of expression fliat differs from wild type in terms of altered, e.g., increased or 
decreased, expression (as compared with wild type) in a predetermined cell type or tissue 
type; a pattem of expression that differs from wild type in terms of tiie splicing size, 
15 translated anuno add sequence, post-transitional modification, or biological activity of the 
eiqpressed polypeptide; a pattem of expression tiiat diffei!s from wild type in terms of ftie 
effect of an environmental stimulus or ettracellular stimulus on expression of the gene, e.g., 
a pattem of increased or-decreased expression (as compared with wild type) in the presence 
of an increase or decrease in the strength of the stimulus. 
20 "Subject," as used herein, refers to human and non-human animals. The term '"non- 

human animals" of the invention includes all vertebrates, e.g., mammals, such as non-human 
primates (particularly higher primates), sheep, dog, rodent (e.g., mouse or rat), guinea pig, 
goat, pig, cat rabbits, cow, and non-mammals, such as chickens, amphibians, reptiles, etc! 
In a preferred embodiment, the subject is a human. In another embodiment, the subject is an 
25 experimental animal or animal suitable as a disease model. 

A "purified preparation of ceUs*', as used herein, refers to an in vitro preparation of 
cells. In the case cells from multicellular organisms (e.g., plants and animals), a purified 
preparation of cells is a subset of cells obtained from the organism, not the entire intact 
organism. In the case ofuniceUularmicroorganisnfis (e.g., cultured cells and microbial 
30 cells), it consists of a preparation of at least 10% and more preferably 50% of the subject 
cells. 

Various aspects of the invention are described in fiirther detail below. 
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Isolated Nucleic Acid Molecules 

I 

Id one aspect, the invention provides, an isolated or purified, nucleic acid molecule 
that encodes a 31939 polypeptide described herein, e.g., a full-length 31939 protein or a 
fragment thereof e.g., a biologically active portion of 31939 protein. Also included is a 

5 nucleic acid fragment suitable for use as a hybridization probe, which can be used, e.g,, to 
idmtify a nucleic acid molecule «coding a polypeptide of the invention, 31939 mRNA,|and 
fragmmts suitable for use as primers, e.g., PCR primers for the amplification or mutation of 
nucleic acid molecules. ' 
In one embodiment, an isolated nucleic acid molecule of the invention includes the 

10 nucleotide sequence shown in SEQ ID NO: 1 , or a portion of any of these nucleotide 

sequences. In one embodiment, the nucleic acid molecule includes sequences encoding the 
human 31939 protein (i.e., 'the coding region" of SEQ ID N0:1, as shown in SEQ ID 
N0:3), as well as 5' untranslated sequences. Alternatively, the nucleic acid molecule can 
include only the coding region of SEQ ID N0:1 (e.g., SEQ ID N0:3) and, e.g., no flanking 

15 sequences which normally accompany the subject sequence, hi anoflier CTtibodiment, the 
nucleic acid molecule encodes a polypeptide selected from the group consisting of a fragment 
from about amino acid 56 to 362; a fragment from about 38 to 713; a fragment from about 38 
to 576; and a fragment from about 598 to 713 of SEQ ID N0:2. 

In another embodiment, an isolated nucleic acid molecule of the invention includes a 

20 nucleic acid molecule which is a complement of the nucleotide sequence shown in SEQ ID 
NO:l.or SEQ ID NO:3, or a portion of any of these nucleotide sequences, to other 
embodiments, flie nucleic acid molecule of the inventicai is sufficiMtly complementary to 
the nucleotide sequence shown in SEQ ID NO: 1 or SEQ ID N0:3, such that it can hybridize 
(e.g., under a stringency condition described herein) to the nucleotide sequence shown in 

25 SEQ ID NO: 1 or 3, thereby forming a stable duplex. 

to one embodiment, an isolated nucleic acid molecule of the present invention 
includes a nucleotide sequence which is at least about 70%, 75%, 80%, 85%, 90%, 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more homologous to the entire length of 
the nucleotide sequence shown in SEQ ID N0:1 or SEQ ID N0:3, or a portion, preferably 

30 of the same length, of any of these nucleotide sequences. 
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31939 Nucleic Acid Fragments 

A nucleic acid molecule of the invention can include only a portion of the nucleic 
acid sequence of SEQ ID N0:1 or 3. For example, such a nucleic acid molecule can include 
a fragment which can be used as a probe or primer or a fragment encoding a portion of a 
5 31939 protein, e.g., an immunogenic or biologically active portion of a 31939 protein. A 
fragment can comprise, for example, those nucleotides of SEQ ID NO: 1 , which encode an 
LRR domain of human 3 1939, e.g., an LRR domain found in the region of about residues 56 
to 362 of SEQ ID N0:2; those nucleotides of SEQ ID N0:1, v^ch encode an 
immunoglobulin domain from about residues 378 to 438 of SEQ ID N0:2; or those 

10 nucleotides of SEQ ID N0:1 \^ch encode an extracellular domain from about residues 38 
to 576 of SEQ ID N0:2. The nucleotide sequence determined from the cloning of fte 3 1939 
gene allows for the generation of probes and primers designed for use in identifying and/or 
cloning other 3 1 939 family members, or fragments Ihereoj^ as well as 3 1 939 homologues, or 
fragments thereol^ from other species. 

1 5 In another embodim^ a nucleic acid includes a nucleotide sequence Ifaat includes 

part, or all, of tiie coding region and extmds into eifli^ (or both) the 5' or 3' noncoding regioa 
Oflier embodiments include a fragment which includes a nucleotide sequoice encoding an 
amiho add fragment described hereinrNucleic acid fragments can encode a specific domain or 
site described herein or fragments thereoj^ particularly fragments thereof which are at least 

20 about 100, 300, 400, 500, 600, 700, 800, 900, 1000, 1250, 1500, or 2000 amino acids in length. 
Fragments also include nucleic acid sequences corresponding to specific amino add sequences 
described above or fragments thereof Nucldc acid Augments should not to be construed as 
encompassing fliose fi:agments tiiat may have been disclosed prior to the invention 

A nucleic acid fragment can include a sequence corresponding to a domain, region, or 

25 fiinctional site described herein. A nucleic acid fragment can also include one or more domain, 
region, or frmctional site described herein. Thus, for example, a 31939 nudeic acid fragment 
can include a sequence corresponding to a LRR domain, an Ig domain, on extracellular domain, 
or an intracellular domain. 

31939 probes and primers are provided. Typically a probe/primer is an isolated or 

30 purified oligonucleotide. The oligonucleotide tj^pically includes a region of nucleotide 
sequence tiiat hybridizes under a stringracy condition described h^ein to at least about 7, 12 
or 15, prrferably about 20 or 25, more preferably about 30, 35, 40, 45, 50, 55, 60, 65, or 75 
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consecutive nucleotides of a sense or antisense sequence of SEQ ID NO: 1 or SEQ ID NO:3, 
or of a naturally occurring allelic variant or mutant of SEQ ID N0:1 or SEQ ID N0:3. ' 

In a preferred embodiment the nucleic acid is a probe which is at least 5 or 10, and 
less than 200, more preferably less than 100, or less than 50, base pairs in length. It should 
5 be identical, or differ by 1, or less than in 5 or 1 0 bases, from a sequence disclosed herein. 
If alignment is needed for this comparison tiie sequences should be aligned for maximum 
homology, 'looped" out sequences from ddetions or ins^ons, or misniatcfaes, are 
considered differmces. 

A probe or primer can be derived from the sense or anti-sense strand of anucleic add 
10 which encodes: 

an N-terminal Qeucine-rich repeat) LRR domain (PFAM Accession PF01462) 
located at about amino acid 56 to 85 of SEQ ID N0:2; 

an LRR domain (PFAM Accession PF00560) located at about amino acid 87 to 1 10, 
111 to 134, 135 to 158, 159to 182, 183 to 207, 208 to 229, 230 to 253, 254 to 277, or 278 to 
15 301 ofSEQIDNO:2; 

a C-tenninal LRR domain (PFAM Accession PF01463) located at about amino acid 
311 to 362 of SEQ ID NO:2; 

an LRR region locate at about amino acid 56 to 362 of SEQ ID N0:2 

an immunoglobulin domain (PFAM Accession PF00047) located at about amino 
20 acid 378 to 438 of SEQ ID N0:2; 

a predicted extracellular domain located at about amino acids 38 to 576 of SEQ ID 

N0:2; 

a predicted transmembrane domain located at about amino acids 577 to 597 of SEQ 
IDNO:2;or 

25 a predicted cytoplasmic domain located at about amino acids 598 to 713 of SEQ ID 

N0:2. 

bi another embodiment a set of primers is provided, e.g., prim^ suitable for use in a 
PCR, which can be used to amplify a selected region of a 31939 sequence, e.g., a domain, 
region, site or otiier sequence described hereia The primers should be at least 5, 10, or 50 
30 base pairs in length and less than 100, or less than 200, base pairs in length. The primers 
should be identical, or differs by one base from a sequence disclosed herein or from a 
naturally occurring variant For example, primers suitable for amplifying all or a portion of 
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any of the following regions are provided a LRR domain from about amino acid 56 to 362 of 
SEQ ID N0:2, or any domain or region described hereia 

A nucleic acid fragment can encode an epitope bearing region of a polypeptide 
described hereia 

5 A nucleic acid fragment encoding a "biologically active portion of a 31939 

polypeptide" can be prepared by isolating a portion of the nucleotide sequence of SEQ ID 
N0:1 or 3, which encodes a polypeptide having a 31939 biological activity (e.g., the 
biological activities of the 31939 proteins are describee! herein), expressing the encoded 
portion of Ae 31939 protein (e.g., by recombinant expression in vitro) and assessing the 

10 activity of the encoded portion of the 3 1939 protein. For example, a nucleic acid fragment 
encoding a biologically active portion of 31939 includes a LRR domain, e.g., amino add 
residues about 56 to 362 of SEQ JD N0:2. A nucleic acid fragment encoding a biologically 
active portion of a 31939 polypeptide, may comprise a nucleotide sequence which is greater 
than 300 or more nucleotides in lengtii. 

15 In preferred embodiments, a nucleic acid includes a nucleotide sequence which is 

about 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1500, 1750, 2000 or more 
nucleotides in length and hybridizes under a stringency condition described herein to a 
nucleic acid molecule of SEQ ID NO:i, or SEQ ID N0:3. hi one preferred embodimrat, a 
nucleic acid fragment includes at least one contiguous nucleotide from one of the following 

20 regions: about nucleotides 1 to 151, 1 to 500, 200 to 359, 500 to 800, 500 to 829, 830 to 
1500, 1000 to 1500, 1000 to 1543, 1 100 to 1500, 2098 to 2493, or 2200 to 2493 of SEQ ID 
N0:3. In other embodiments, a nucleic acid fragment includes at least one of the following 
regions: about nucleotides 1 to 151, 1 to 500, 200 to 359, 500 to 800, 500 to 829, 830 to 
1500, 1000 to 1500, 1000 to 1543, 1 100 to 1500, 2098 to 2493, or 2200 to 2493 of SEQ ID 

25 N0:3. 

31939 Nucleic Acid Variants 

The invention Anther encompasses nucleic acid molecules that differ from the 
nucleotide sequOTce shown in SEQ ID N0:1 or SEQ ID N0:3. Such differences can be due 
30 to degeneracy of the genetic code (and result in a nucleic acid which encodes tiie same 
31939 proteins as those encoded by the nucleotide sequence disclosed herein. In another 
embodiment, an isolated nucleic acid molecule of the invention has a nudeotide sequence 
encoding a protein having an amino acid sequence which differs, by at least 1, but less than 
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5, 10, 20, 50, or 100 amino acid residues that shovra in SEQ ID N0:2. If alignment is 
needed for Ihis comparison the sequences should be aligned for maximum homology. ' 
"Looped" out sequaices from ddetions or insertions, or mismatches, are considered 
difiEeraices. 

5 Nucleic acids of the inventor can be chosen for having codons, which are prefeired, 

or non-prefeired, for a particular expression system. E.g., the nucleic acid can be oneiin 
which at least one codon, at preferably at least 10%, or 20% of the codons has been altered 
such that the sequence is optimized for expression in K cott, yeast, human, insect, or piO 
cells. 

Nucleic acid variante can be naturaHy occurring, such as aUelic variants (same locus), 
homologs (different locus), and orthologs (different organism) or can be non naturally 
occurring. Non-natura%occurririg variants can be made by mutagenesis techniques, indudiiig 
fliose applied to polynucleotides, cells, or oigaoisms. The variants can contain nucleotide 
substitutions^ deletions, inversions and inswtions. Variation can occur in dtiier or both the 
15 codir^g and non-coding regions. Hie variations can produce both conservative and non- 
conservative amino acid substitutions (as compared in tiie encoded product). 

la a preffflTedanbodimentfte nucleic add differs fiomflmtof SEQIDNO: 1 or 3, 
e.g., as foUows: byat least one but less than 10^ 20, 30, or 40nudeotides; atleast<»iebutless 
flian 1%^ 5%, 10% or 20% of flienudeotides in flie subject nuddc add. If necessary fortius 
analysis tiie sequences should be aligned &r maximum homology, 'looped" out sequences 
from ddetions orinsalions, or mismatches, are crasideaBd differences. 

Orthologs, homologs, and allelic variants can be identified using mefliods known in the 
art These variants comprise a nucleotide sequence encoding a potypeptide that is 50%, at least 
about 55%, typically at least about 70-75%, more ^ically at least about 80-85%, and most 
^ically at least about 90-95% or more identical to the nucleotide sequence shown in SEQ ID 
N0:2 or a fragment of this sequence. Such nucleic add molecules can readily be idaitified as 
being able to hybridize under a stringency condition described herein, to tiie nudeotide 
sequence shown in SEQ ID NO 2 or a fluent of the sequence. Nucleic acid molecules 
corresponding to orfliologs, homology and allelic variants of tiie 31939 cDNAs of the 
invention can fiirther be isolated by mappirig to tiie same dm)mosome or locus as the 31939 
gene. 

Preferred variants indude tiiose tiiat are corrdated wifli binding an extracellular 
polypeptkle, e.g., a signalirig molecule. 
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Allelic variants of 31939, e.g., human 31939, include botii functional and non- 
functional proteins. Functional aUelic variants are naturally occurring amino acid sequence 
variants of the 31939 protein within a population that maintain the ability to bind an 
wtracellular polypeptide, e.g., a signaling molecule. Functional allelic variants will typically 
5 contain only conservative substitution of one or more amino acids of SEQ ID N0:2, or 
substitution, deletion or insertion of non-critical residues in non-critical regions of the i 
proteia Non-fimctionalaUeUc variants are naturaUy-occuninganiino acid sequence I 
variants of tiie 31939, e.g,, human 31939, protein within a population that do not havejthe 
ability to bind an extracellular polypeptide, e.&, a signalipg molecule. Non-functional allelic 

10 variants will ^ically contain a non-conservative substitution, a deletion, or insertion, or 
premature truncation of the amino acid sequence of SEQ ID N0:2, or a substitution, 
insertion, or deletion in critical residues or critical regions of the protein. 

Moreover, nucleic acid molecules encoding ofter 31939 family members and, flius, 
which have a nucleotide sequence which differs from the 31939 sequences of SEQ ID N0:1 

15 or SEQ ID N0:3 are intended to be wilhin the scope of the invmtion, 

Antisense Nucleic Acid Molecules. Ribozvmes and Mod ified 31939 Nucleic Acid 
Molecules " ~ ~ 

In another aspect, Ae invention features, an isolated nucleic acid molecule which is 

20 antisense to 3 1 939. An "antisense" nucleic acid can include a nucleotide sequence which is 
complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA 
sequence. The antisense nucleic acid can be complementary to an entire 31939 coding 
strand, or to only a portion Aereof (e.g. , the coding region of human 3 1 939 corresponding to 

25 SEQ ID N0:3). In another embodiment, the antisense nucleic acid molecule is antisense to 
a **noncoding region'* of the coding strand of a nucleotide sequence encoding 3 1939 (e.g., 
the 5' and 3' untranslated regions). 

An antisense nucleic acid can be designed such that it is complementary to Ae entire 
coding region of 3 1939 mRNA, but more preferably is an oligonucleotide which is antisense 

30 to only a portion of the coding or noncoding region of 31939 mRNA. For example, the 
antisense oligonucleotide can be complementary to the region suirounding the translation 
start site of 31939 mRNA, e.g., between Ihe -10 and +10 regions of the target gene 
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nucleotide sequence of interest An antisense oligonucleotide can be, for example, about 7, 
10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, or more nucleotides in length. 

An antisense nucleic acid of the invention can be constructed using chemical 
synthesis and enzymatic ligation reactions using procedures known in the art. For example, 
5 an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously modified nucleotides designed to increase 
the biological stability of the molecules or to increase the physical stability of the duplex 
formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. The antisense nucleic acid also can be 
10 produced biologically using an rapression vector into whidi a nucleic acid has been 

subcloned in an antisense orientation (i.e., RNA transcribed firom the inserted nucleic acid 
will be of an antisense orientation to a target nucleic acid of interest, desoibed further in the 
following subsection): 

The antisense nucleic acid molecules of the invention are typically administered to a 
15 subject (e.g., by direct injection at a tissue site), or generated in situ such that th^ hybridize 
witii or bind to cellular mRNA and/or genomic DNA encoding a 31939 protein to thereby 
inhibit expression of the protein, e.g., by inhibiting transcription and/or translation. 
Alternatively, antisense nucleic add molecules can be modified to target selected cells and 
tiim administered systemically. For systemic administration, antisense molecules can be 
20 modified such that they specifically bind to receptors or antigens expressed on a selected 
cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or antibodies 
which bind to cell surface receptors or antigens. The antisense nucleic acid molecules can 
also be delivered to cells using the vectors described herein. To achieve sufficient 
intracellular concentrations of the antisense molecules, vector constructs in which tiie 
25 antisense nucleic acid molecule is placed under the control of a strong pol II or pol HI 
promoter are preferred. 

In yet anotiier embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double^tranded hybrids with complementary RNA m whidi, contraiy to the usual p-units, 
30 the strands run parallel to each otiier (Gaultier etal (19^7) Nucleic Acids, Res. 15:6625- 
6641). The antisense nucleic acid molecule can also comprise a 2 -o-metfaylribonucleotide 
QnoueetaL {19%!) Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue 
(Inoueera/. {\9%1) FEBS Lett 215:327-330). 
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In still another embodiment, an antisense nucleic acid of the invention is a ribozyme, 
A ribozyme having specificity for a 3 1939-encoding nucleic acid can include one or more 
sequences complementary to the nucleotide sequence of a 31939 cDNA disclosed herein 
(i.e., SEQ ID N0:1 or SEQ ID N0:3), and a sequence having known catalytic sequence 
5 responsible for mRNA cleavage (see US. Pat No. 5,093,246 or Haselhoff and Gerlach 
{19S«) Nature 334:585-591). For example, a derivative of a Tetrahymem L-19 IVS RNA 
can be constructed in which the nucleotide sequence of the active site is complementaiy to 
the nucleotide sequence to be cleaved in a 31939-encodmg mRNA. See, e.g., Cech er \il 
U.S. Patent No. 4,987,071; and Cech etal U.S. PatentNo. 5,116,742. Altematively, 31939 
10 mKNA can be used to select a catalytic RNA having a specific ribonuclease activity fi:om a 
pool of RNA molecules. See, e.g., Bartel, D. and Szostak, XW. {1993) Science 261:1411- 
1418. 

31939 gene expression can be inhibited by targeting nucleotide sequmces 
complementary to the regulatory region of the 31939 (e.g., the 31939 promoter and/or 

15 enhancers) to form triple helical structures that prevrat transcription of the 31939 gene in 
target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6:569-84; Helene, C, i 
(1992) Ann, KZ Acad Sci, 660:27-36; and Maher, L.J. (1992) Bioassays 14:807-15. Hie 
potential sequences that can be targeted for triple helix formation can be increased by 
creating a so-called "switchback" nucleic acid molecule. Switchback molecules are 

20 ^thesized in an alternating 5'-3', 3*-5' manner, such that they base pah- with first one strand 
of a duplex and then the other, eliminating the necessity for a sizeable stretch of either 
purines or pyrimidines to be present on one strand of a duplex. 

The invention also provides detectably labeled oligonucleotide primer and probe 
molecules. Typically, such labels are chemiluminescent, fluorescent, radioactive, or 

25 colorimetric. 

A 31939 nucleic acid molecule can be modified at the base moiety, sugar moi^ or 
phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the 
molecule. For non-limiting examples of synthetic oligonucleotides with modifications see 
To\M& {2001) Nature Biotech 19:17 mdFm^i etal. {2001) Nature Biotech. 19:40-44. 
30 Such phosphoramidite oligonucleotides can be effective antisense agents. 

For example, the deoiQribose phosphate backbone of the nucleic acid molecules can 
be modified to generate peptide nucleic acids (see Hyrup B. et al. (1 996) Bioorganic & 
Medicinal Chemistry 4: 5-23). As used herein, the terms ^'peptide nucleic add" or **PNA*' 
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refers to a nucleic acid mimic, e.g., a DNA mimic, in which the deoxyribose phosphate 
backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are 
retained. The neutral backbone of a PNA can allow for specific hybridization to DNA and 
RNA under conditions of low ionic strength. The synthesis of PNA oUgomers can be 
5 perfonned using standard solid phase peptide synthesis protocols as described in Hynip B. 
etal (1996) supra rndfeny-OTIieGioetaL Proa NatL Acad, Set 93: 14670-675, 

PNAs of 31939 nucleic acid molecules can be used in therapeutic and diagnostic 
applications. For example, PNAs can be used as antisense or antigene agents for sequence- 
specific modulation of gene expression by, for example, inducing transcription or translation 

10 arrest or inhibiting replication- PNAs of 31 939 nucleic acid molecules can also be used m 
Ihe anafysis of single base pair mutations in a gene, (e.g., by PNA-directed PCR clamping); 
as 'artificial restriction en2^es* vAiea used in combination with other enzymes, (e.g., SI 
nucleases (Hyrup B. et al (1996) supra)); or as probes or primers for DNA sequencing or 
hybridization (Hyrup B. etal (1996) supra; Perry-OlCeefe supra), 

15 In other embodiments, the oligonucleotide may include other appended groups such 

as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating tramport 
across the cell membrane (see, e.g., Letsinger etal. (1989) Prac. Natl Acad Sci, USA 
86:6553-6556; Lemaitre er a/. (1987) Proc. Natl-Acadv Sci. USA 84:648-652; PCT 
Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 

20 W089/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
cleavage agents (see, e.g., Krol etal. (1988) Bio-Techniques 6:958-976) or intercalating 
agents, (see, e.g., Zon (\9Z^) Pharm. Res, 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, (e.g., a peptide, hybridization triggered cross- 
linking agent transport agent, or hybridization-triggered cleavage agent). 

25 Hie invention also includes molecular beacon oligonucleotide primer and probe 

molecules having at least one region ^ch is complementary to a 3 1939 nucleic add of the 
invention, two complementary regions one having a fluorophore and one a quencher such 
that the molecular beacon is useful for quantitating the presence of the 3 1 939 nucleic add of 
Hie invention in a sample. Molecular beacon nucleic acids are described, for example, in 

30 Uzardi et al., U.S. Patent No. 5,854,033; Nazarenko et al, U.S. Patent No. 5,866,336, and 
livak era/., U.S^ Patent 5,876,930. 
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Ty»^fltP( ^ 31 939 PoWneDtides ^ 

In another aspect, fte invention features, an isolated 31939 protein, or firagmeat, e.g., 
a biologically active portion, for use as immunogens or antigens to raise or test (or more 
generally to bind) anti-31939 antibodies. 31939 protein can be isolated from cells or tissue 
5 sourcesusingstandaidproteinpurificationtechniques. 31939 protein or fragments thereof 
can be produced by recombinant DN A techniques or syn&esized diemically. J 

Polypeptides of the invention include tfiose which arise as aresult of the existence of 



multiple genes, alternative transcription events, alternative RNA spUdng events, and 
alternative translational and post-translational events. The polypeptide can be expressed m 
10 systems, e.g., cultured cells, which result in substantially the same post-translational 

modifications present when expressed the polypeptide is expressed in a native cell, or in 
systems which result in the alteration or omission of post-translational modifications, e.g., 
glycosylation or cleavage, present when expressed in a native ceU. 

in a preferred embodiment, a 31939 polypeptide has one or more of the foUowing 

15 diaiacteristics: 

(0 it has the abihly to modulate a signal transduction pathway, e.g., to modulate ceU 

proliferation or differentiation; 

(u) h interacts wiA an extraceUular polypeptide, e.g., a ceU signaling molecule o^ 

^ctracellular matrix polyp^tide; 
20 Ciii) it has a molecular weight, e.g., a deduced molecular weight, preferably ignoring 

any contribution of post translational modifications of a 31939 polypeptide. e.g., a 

polypeptide of SEQ ID N0:2; 

(iv) it has an ammo add conq)osition or oAer physical diaracteristics of a 

polypeptide having the sequence 6f SEQ ID N0:2; 
25 (V) it has an overall sequence sunflarity of at least 60%. preferably at least 70%. 

more preferably at least 80, 90, or 95%, with a polypeptide of SEQ ID N0:2; 

(vi) it has one or more, preferably nine LRR domains whidi are preferably about 
70% 80% 90% or 95% homologous with amino add residues 56 to 85, 87 to 1 10, 1 11 to 
134, 135 to 158, 159to 182. 183 to 207, 208 to 229, 230 to 253, 254 to 277. 278 to 301, or 

30 31 1 to 362 of SEQ ID N0:2; 

(vii) it has an immunoglobulin domam whidi is preferably about 70%. S0%, 90% or 
95% homologous with amino add residues 378 to 438 or SEQ ID N0:2; 
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(viii) it has a predicted extracellular domain located at about amino acids 38 to 576 
ofSEQIDNO:2; 

(ix) it has a predicted transmembrane domain located at about amino acids 577 to 
597ofSEQIDNO:2; 

5 (x) it has a predicted cytoplasmic domain located at about amino acids 598 to 7 13 of 

SEQIDNO:2;or 

(xi) it has at least 5, 9, 10, 12, and preferably 15 of the cysteines found amino acid 

I 

sequence of tiie native protein. 

hi a preferred embodiment the 31939 protein, or fragment thereof differs from the 

10 corresponding sequence in SEQ ID:2. In one embodiment it differs by at least one but by 
less than 15, 10 or 5 amino acid residues. In another it differs from the corresponding 
sequence in SEQ ID N0:2 by at least one residue but less than 20%, 15%, 10% or 5% of the 
residues in it differ from the corresponding sequence in SEQ ID N0:2. (If this comparison 
requires alignment the sequences should be aligned for maximum homology. "Looped" out 

15 sequences from deletions or insertions, or mismatches, are considered diflferraices.) The 
differences are, preferably, differences or changes at a non-essential residue or a 
conservative substitution. In a preferred embodiment the differences are not in the LRR 
region. In anodier preferred embodiment one or more differences are in the LRR region. 
Other embodiments include a protein tiiat contain one or more changes in amino acid 

20 sequence, e.g., a change in an amino acid residue which is not essential for activity. Such 
31939 proteins differ m amino acid sequence from SEQ ID N0:2, yet retain biological 
activity. 

In one embodim^t, flie protein includes an amino acid sequence at least about 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more homologous to SEQ ID N0:2. 

25 A 3 1 939 protein or fragment is provided which varies from the sequence of SEQ ID 

N0:2 in regions defined by amino acids about 38 to 576 by at least one but by less than 15, 
10 or 5 amino acid residues in the protein or fragment but v^ch does not differ from SEQ 
ID N0:2 in regions defined by amino adds about 38 to 576. (If this comparison requfres 
alignment the sequences should be aligned for maximum homolos^. 'Xooped" out 

30 sequences &om deletions or insertions, or mismatdies, are considered differences.) In some 
embodiments flie differmce is at a non-essential residue or is a conservative substitution, 
while in others the difference is at an essential residue or is a non-conservative substitution. 
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In one embodiment, abiologicaUy active portion of a 3 1939 protein includes an LRR 
domain. Moreover, oAer biologicaUy active portions, in which other regions of the protein 
are deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
fimctional activitira of a native 3 1 939 protein. 
5 In a preferred embodiment, the 31939 protein has an amino acid sequence shovwi in 

SEQ ID N0:2. In other embodiments, flie 31939 protein is substantially identical to SEQ ID 
N0:2. In yet another embodiment, the 31939 protein is substantialfy identical to SEQ ID 
N0:2 and retains the functional activity of tiie protein of SEQ ID N0:2, as described in| 
detail in the subsections above. 



10 



20 



^1 Chimeric or Fusion Proteuis 

m another aspect, the invention provides 31939 chimeric or fusion proteins. As used 
herein, a 31939 "chimeric protein" or "fiision protein" includes a 31939 polypeptide linked 
to a non-31939 polypeptide. A ••non-31939 polypeptide" refers to a polypeptide having an 
15 amino acid sequence corresponding to a protein which is not substantially homologous to 
the 31939 prolfein, e.g.. a protein which is different from tiie 31939 protein and which is 
derived from the same or a different organism. The 31939 polypeptide of the fusion protein 
can correspond to all or a portion e.g., a fragment described herein of a 3 1939 amino acid 
sequence. In a preferred embodiment, a 31939 fiision protein includes at least one (or two) 
biologically active portion of a 31939 protein. The non.31939 polypeptide can be fiised to 

Ae N-terminus or C-temiinus of tiie 31939 polypeptide. 

niefijsion protein can include a moiety which has a high affinity for a Ugand. For 

example, the fusion protein can be a GST-31939 fiision protein in which the 31939 
sequences are fiised to tiieC-tenninusoflhe GST sequences. Such fiision proteins can 
facilitate the purification of recombinant 31939. Alternatively, die fiision protein can be a 
31939 protein containing aheterologous signal sequence at its N-termmus. In certain host 
ceUs (e.g., mammalian host ceUs), expression and/or secretion of 31939 can be mcreased 
through use of a heterologous signal sequence. 

Fusion proteins can include all or a part of a serum protein, e.g., an IgG constant 
30 region, or human serum albumin. 

The 31939 fiision proteins of the invention can be incorporated into phannaceutical 
compositions and admmistered to a subject in vivo. Hie 31939 fiision proteins can be used 
to affect the bioavailability of a 31939 substrate. 31939 fiision proteins may be usefiil 



25 
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20 



25 



therapeutically for flie treatment of disorders caused by, for example, (i) aberrant 
modification or mutation of a gene encoding a 31939 protein; (ii) mis-regulatioa of the 
31939 gene; and (iii) aberrant post-traoslational modification of a 31939 proteia 

Moreover, tiie 31939-fiision proteins of the mvention can be used as immunogens to 
produce anti-31939 antibodies in a subject, to purify 31939 Ugands (see "Anti-31939 
Antibodies," below) and in screening assays to identify molecules v^Aich inhibit flie 

interaction of 31939 with a 31939 substi:ate (see '*Screening Assays," below). 

Expression vectors are commeicialfy available that aheady encode a fiision moiely 
(e.g., a GST polypeptide). A 31939-encoding nucleic acid can be cloned into Such an 
expression vector such that the fusion moiety is linked in-frame to the 31939 proteia 



30 



Variants of 31 939 Proteins 

In another aspect, the invention also features a variant of a 31939 polypeptide, e.g., 
which functions as an agonist (mimetics) or as an antagonist. Variants of the 31939 proteins 
15 can be generated by mutagenesis, e.g., discrete point mutation, the insertion or deletion of 
sequences or the truncation of a 3 1939 proteia An agonist of tiie 3 1939 proteins can retain 
substantially tiie same, or a subset, of the biological activities of the naturally occurring form 
of a 31939 proteia Mi aitegonist 6f a 31939 proteincmi inhibit one or more of the 
activities of the naturally occuiringform of the 31939 protein by, for example, competitively 
modulating a 31939-mediated activity of a 31939 protem. Thus, specific biological effects 
can be eUcited by treahnent witii a variant of limited fimctioa Preferably, treatment of a 
subject with a varianthaving a subset of the biological activities of tiie naturally occurring 
fonn of the protein has fewer side eflfects in a subj ect relative to treahnent with tiie naturally 
occurring form of tiie 31939 proteia 

Variants of a 31939 protein can be identified by screening combinatorial Ubraries of 
mutants, e.g.. truncation mutants, of a31939 protein for agonist or antagonist activity. 

Libraries of fragments e.g.. N terminal, C teimmal, or mtemal firagments, of a 31939 
protein coding sequence can be used to generate a variegated population of fragments for 
screening and subsequent selection of variants of a 31939 proteia Variants in which a 
cysteine residues is added or deleted or in which a residue which is glycosylated is added or 
deleted are particularly preferred. 

Metiiods for screening gene products of combinatorial Ubraries made by point 
mutations or truncation, and for screening cDNA libraries for gene products having a 
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selected property are known in the art. Such methods are adaptable for rapid screening of 
the gene libraries generated by combinatorial mutagenesis of 31939 proteins. Recxirsiv^ 
ensemble mutagenesis (REM), a new technique wiiich enhances the frequency of functional 
mutants in tfie libraries, can be used in combination with the screening assays to id^tify 
5 31939 variants (Arkin and Yourvan (1992) Proc. Natl Acad, ScU USA 59:7811-7815; 
Delgjcmeetal. (1993) Protein Engimering 6:327-331^ i 
Cell based assays can be exploited to analyze a variegated 3 1939 library. For / 
example, a libraiy of expression vectors can be transfected into a cell line, e.g., a cell fine, 
which ordinarily responds to 31939 in a substrate-dependent manner. The transfected cells 

10 are tiien contacted with 31939 and Ifae effect of fte e3q)ression of the mutant on signaling by 
Oie 31939 substrate can be detected, e.g., by measuring the rate of cell proliferation and the 
abundance of cell differentiative markers. Plasmid DNA can then be recovered from the 
cells vMdtk score for inhibition, or altematively, potentiation of signaUng by the 31939 
substrate, and the individual clones frirther characterized. 

15 In another aspect, flie invention features a method of making a 3 1 939 polypeptide, 

e.g., a peptide having a non-wild type activity, e.g., an antagonist, agonist, or super agonist 
of a naturally occurring 31939 polypeptide, e.g., a naturally occurring 31939 polypeptide. 
The method includes: altering the sequence of a 3 1939 polypeptide, e.g., altering the 
sequence , e.g., by substitution or deletion of one or more residues of a non-conserved 

20 region, a domain or residue disclosed herein, and testing tfie altered polypeptide for the 
desired activity. 

In another aspect, the invention features a method of making a fragment or analog of 
a 31 939 polypeptide a biological activity of a naturally occurring 3 1 939 polypeptide. The 
method includes: altering the sequence, e.g., by substitution or deletion of one or more 
25 residues, of a 31939 polypeptide, e.g., altering the sequence of anon-conserved region, or a 
domain or residue described herein, and testing the altered polypeptide for the desured 
activity. 

Anti-31939 Antibodies 
30 In another aspect, the invention provides an anti-31939 antibody, or a fragment 

thereof (e.g., an antigen-binding fragment thereof). Hie term ""antibod/' as used herein 
refers to an immunoglobulin molecule or immunologically active portion tiiereoi^ i.e., an 
antigen-bmding portion. As used herein, the term "antibody^ refers to a protein comprising 
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at least one, and preferably two, heavy (H) chain variable regions (abbreviated herein as 
VH), and at least one and preferably two light (L) chain variable regions (abbreviated herein 
as VL). The VH and VL regions can be fiirther subdivided into regions of hypervariability, 
termed ^^complementarity determining regions" ("CDR"), interspersed witti regions that are 
5 more conserved, termed '"framework regions" (FR). The extent of the framework region and 
CDR's has been precisely defined (see, Kabat, E.A., etal (1991) Sequences of Proteins of 
Immunological Interest, Fifth Edition^ U S. Department of Health and Human Services, NIH 
PubUcalionNo. 91-3242, and Choihia, C. et al. (1987)' J. Mol. Biol. 196:901-917, which are 
incorporated herein by reference). Each VH and VL is composed of three CDR's and four 

10 FRs, arranged from amino-tennmus to carboTQT-terminus in tiiefoUowm FRl, 
CDRl, FR2, CDR2, FR3, CDR3, FR4, 

The anti-31939 antibody can fiirdier include a heavy and light chain constant region, 
to thereby form a heavy and light immunoglobulin chain, respectively. In one embodiment, 
the antibody is a tetramer of two heavy immunoglobulin chains and two light 

15 inununogiobulin chains, wherein the heavy and Ught immunoglobulin chains are inter- 
connected by, e.g., disulfide bonds. The heavy chain constant region is comprised of three 
domains, CHI, CH2 and CH3. The light chain constant region is comprised of one domain, 
CL. The variable region of the heavy and light chains contains a binding domain that 
interacts with an antigen. The constant regions of the antibodies typically mediate the 

20 binding of the antibody to host tissues or factors, including various cells of the immune 
system (e.g., effector cells) and the first component (Clq) of the classical complement 
system. 

As used herein, the term "immunoglobulin" refers to a protein consisting of one or 
more pofypeptides substantially encoded by immunoglobulin genes. The recognized human 

25 immunoglobulin genes include the kappa, lambda, alpha (IgAl and IgAZ), gamma (IgGl, 
IgG2, IgG3, IgG4), delta, epsilon and mu constant region genes, as well as tfie myriad 
immunoglobulin variable region genes. FulHength inmiunoglobulin 'light chains" (about 
25 KDa or 214 amino acids) are encoded by a variable region gene at the NH2-terminus 
(about 1 10 amino acids) and a kappa or lambda constant region gene at the 

30 COOH— terminus. Full-length immunoglobulin *teavy chains" (about 50 KDa or 446 
amino acids), are similarly encoded by a variable region gme (about 116 amino acids) and 
one of the other aforementioned constant region genes, e.g., gamma (encoding about 330 
amino acids). 
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The tenn "antigen-binding fragment" of an antibody (or simply "antibody portion," 
or "fragmenfO, as used herein, refers to one or more fragments of a fiiU-length antibody ^ 
retain flie ability to specifically bind to the antigen, e.g., 31939 polypeptide or fragment 
thereof Examples of antigen-binding fragments of the anti-31939 antibody include, but are 

5 not limited to: (i) a Fab fragment, a monovalent fragment consisting of tiie VL, VH, CL and 
CHI domains; (n) a F(ab')2 fragment, a bivalent fragment comprising two Fab fragments 
linked by a disulfide bridge at tiie hinge region; (iii) a Fd firagment consisting of the VH j 
CHI domains; (iv) a Fv fragment consisting of the VL and VH domains of a single arm 
an antibody, (v) a dAb fragment (Ward et al, (1989) Nature 341 :544-546), which consists 

10 of a VH domain; and (vi) an isolated complementarity determining region (CDR). 

Furthermore, altiiough the two domains of the Fv fragment, VL and VH, are coded for by 
separate genes, they can be joined, using recombinant methods, by a synthetic linker fliat 
enables &em to be made as a single protein chain in which the VL and VH regions pair to 
form monovalent molecules (known as single chain Fv (scFv); see e.g.. Bird et al. (1988) 

15 Science ^:423-426; and Huston a/. (1988)Proc. Natl. Acad. Set USA 85:5879-5883). 
Siidi single chain antibodies are also encompassed witiiin tiie term "antigen-binding 
fragment" of an antibody. These antibody fragments are obtained using conventional 
tedfimques known to those with skiU in tiie art, and tiie fragments are screened for utiUty in 
tiie same manner as are intact antibodies. 

20 Hie anti-31939 antibody can be apolyclonal or a monoclonal antibody, hiotiier 

embodiments, tiie antibody can be recombinanfly produced, e.g., produced by phage display 

or by combinatorial metiiods. 

Phage display and combinatorial methods for generating anti-31939 antibodies are 

known in tiie art (as described in, e.g., Ladner et al. U.S. Patent No. 5,223,409; Kang et al. 
25 Ihtemational PubUcationNo. WO 92/18619; Dower et al. International PubUcationNo. WO 

91/17271; Winter et al. Intemational PubKcation WO 92/20791; Markland et al. 

hitemational PubUcation No. WO 92/15679; Brdtiing et al. fatemational Publication WO 

93/01288; McCafferty et al. hitemational PubUcationNo. WO 92/01047; Garrard et al. 

hitemational PubUcationNo. WO 92/09690; Ladner et al. hitemational PubUcationNo. WO 
30 90/02809; Fuchs et al. (1991) S/o/Tec/zwo/ogy 9:1370-1372; Hay et al. (1992) HumAntibod 

Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; GrifKis et al. (1993) 

£MBOy 12:725-734; Hawkins et al. (1992) J Mol Biol ^:889-896; Clackson et al. (1991) 

Nature 352:624-628; Gram et al. (1992)^45 89:3576-3580; Garrad et al. (1991) 
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Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) ATwc^c/rfife^ 19:4133-4137; and 
Barbas et al. (1991) PNAS 88:7978-7982, the contents of all of which are incorporated by 
reference herein). 

In one embodiment, the anti-31939 antibody is a fiilly human antibody (e.g., an 
5 antibody made in a mouse which has been genetically engineered to produce an antibody 
from a human immunoglobulin sequence), or a non-human antibody, e.g., a rodent (mouse 
or rat), goat, primate (e.g., monkey), camel antibody. Preferably, the non-human antibody is 
a rodent (mouse or rat antibody). Method of producing rodmt antibodies are known in Ihe 
art 

10 Human monoclonal antibodies can be generated using transgenic mice carrying the 

human immunoglobulin genes ralfaer dian Ihe mouse system. Splenocytes from these 
transgenic mice immunized wilh ifae antigm of interest are used to produce hybridomas that 
secrete hun[ian mAbs with specific affinities for epitopes from a human protein (see, e.g.. 
Wood et al. International Application WO 91/00906, Kucherlapati et aL PCT publication 

IS WO 91/10741; Lonberg et al. International Application WO 92/03918; Kay et aL 

International Application 92/03917; Lonber& N. et al. 1994 Nature 368:856-859; Green, 
L.L. et al. 1994 Nature Genet 7:13-21; Morrison, S.L. et al. 1994 Pwa Natl. Acad Set 
UiSi4 81:6851-6855; Bniggemanetal.l993 rear-^^ 1993 
PNAS 90:3720-3724; Bruggeman et al. 1991 Eur J Immunol 21:1323-1326). 

20 An anti-3 1 939 antibody can be one in v/hich the variable region, or a portion thereof 

e.g., the CDR's, are generated in a non-human organism, e.g., a rat or mouse. Chimeric, 
CDR-grafted, and humanized antibodies are within the invention. Antibodies generated in a 
non-human organism, e.g., a rat or mouse, and then modified, e.g., in tiie variable 
framework or constant region, to decrease antigenicity in a human are within the invention. 

25 Chimeric antibodies can be produced by recombinant DNA techniques known in tiie 

art For example, a gene encoding tiie Fc constant region of a murine (or other species) 
monoclonal antibody molecule is digested with restriction en2ymes to remove the region 
encoding the miuine Fc, and tiie equivalent portion of a gene encoding a human Fc constant 
region is substituted (see Robinson et al., Memational Patent Publication PCT/US86/02269; 

30 Akira, et al., European Patent Application 184,1 87; Tamguchi, M., European Patent 

Application 171,496; Morrison et al., European Patent Application 173,494; Neuberger et 
al., Jhtemational AppUcation WO 86/01533; Cabilly et al. U.S. Patent No. 4,816,567; 
Cabilty et al., European Patent Application 125,023; Better et al. (1988 Science 240:1041- 
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1043); Liu et al. (1987) PNAS 84:3439-3443; Liu et al., 1987, J. Immunol. 139:3521-3526; 
Sun et al. (1987) PNAS 84:214-21 8; Nishimura et al., 1987, Cane. Res. 47:999-1005; Wolod 
et aL (1985) Nature 314:446-449; and Shaw et al., 1 988, J. Natl Cancer Inst. 80:1553-1559). 
A humanized or CDR-grafled antibody will have at least one or two but generally all three 

5 redpient CDR's (of heavy and or light immuoglobulin diains) r^laced with a donor CDR. 
An antibody be replaced wifli at least a portion of a non-human CDR or only some lof 
the CDR's may be replaced with non-human CDR's. It is only necessary to replace flie | 
number of CDR's required for binding of flie humanized antibody to a 31939 or a fiagment 
thereof Preferably, the donor will be a rodent antibody, e.g., a rat or mouse antibody, and 

10 ftie recipient vrill be a human fiamewoik or a human consensus frameworL Typicdfy.flie 
immunoglobulin providing the CDR's is called flie "donor" and the immunoglobulin 
providing the framework is called flie "acceptor." In one embodiment, the donor 
immunoglobulin is a non-human (e.g., rodent). The acceptor framework is a naturally- 
occurring (e.g., a human) framework or a consensus framework, or a sequaice about 85% or 

15 higher, preferably 90%, 95%, 99% or higher identical thereto. 

As used herein, the tema "consensus sequence" refers to tiie sequaice formed from the 
most frequent^ occurring amino acids (or nucleotides) in a femily of related sequences (See 
e.g., Wiimaker, From Genes to Clones (Valagsgesellschafl, Weinheim, Germany 1987). In a 
femily of proteins, eadi position in the consensus sequence is occupied by the amino acid 

20 occurririg most frequently at that position in flie femily. If two amino acids occur equally 

fiequeoliy, either can be included in fte consensus sequence. A "consensus framework" refers 
to ftie fiameworic re&aa in the consensus immuno^obulin sequence. 

An antibody can be humanized by methods known in tiie art. Humanized antibodies 
can be generated by replacing sequences of the Fv variable region which are not directly 

25 involved in antigen binding wifli equivalent sequaices from human Fv variable regions. 
General metiiods for generating humanized antibodies are provided by Morrison, S. L., 
1985, Science 229:1202-1207, by CM et al, 1986, BioTechnigues 4:214, and by Queen etaL 
US 5,585,089, US 5,693,761 and US 5,693,762, tiie contents of all of which are hereby 
incorporated by reference. Those methods include isolating, manipulating, and expressing 

30 the nucleic acid sequences that encode all or part of immunoglobulin Fv variable regions 
from at least one of a heavy or light chain. Sources of such nucleic acid are well known to 
those skUled in the art and, for example, may be obtained from a hybridoma producing an 
antibody against a 31939 polypeptide or fragment tiiereof The recombmant DNA encoding 
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fte humanized antibody, or fragment fliereot can then be doned into an appropriate, 
expression vector. 

Humanized or CDR-grafted antibodies can be produced by CDR-grafting or CDR 
substitution, wherein one, two, or all CDR*s of an immunoglobulin chain can be replaced. 
5 See e.g., U.S. Patent 5,225,539; Jones et al. 1986 Nature 321:552-525; Verhoeyan et al. 
1988 Science 239:1534; Beidler et al. 1988 J, Immunol 141 :4053-4060; Winter US 
5,225,539, the contents of all of which are hereby expressly incorporated by reference. 
Winter describes a CDR-grafling method which may be used to prepare the humanized 
antibodies of the preset invention (UK Patent Application GB 2188638A, fUed on Mardi 
10 26, 1987; Winter US 5,225,539), the contents of which is expressly incorporated by 
reference. 

Also within the scope of the invention are humanized antibodies in which specific 
amino acids have been substituted, deleted or added. Preferred humanized antibodies have 
amino acid substitutions in flie framework region, such as to improve binding to the antigea 
15 For example, a humanized antibocfy will have framework residues identical to the donor 
framework residue or to another amino acid other than the recipient framework residue. Jo 
generate such antibodies, a selected, small number of acceptor framework residues of flie 
humanized immunoglobulin chain can be replaced by the corresponding donor amino acids. 
Preferred locations of the substitutions include amino acid residues adjacent to the CDR, or 
20 which are capable of interacting with a CDR (see e.g., US 5,585,089). Criteria for selecting 
amino acids from flie donor are described in US 5,585,089, e.g., columns 12-16 of US 
5,585,089, the e.g., columns 12-16 of US 5,585,089, the contents of which are hereby 
incorporated by reference. Other techniques for humanizing antibodies are described in 
Padlan et al. EP 519596 Al, published on December 23, 1992. 
25 In preferred embodiments an antibody can be made by immunizing with purified 

31939 antigen, or a fragment thereof e.g., a fragment described herein, membrane 
associated antigen, tissue, e.g., crude tissue preparations, whole cells, preferably living cells, 
lysed cells, or cell fractions, e.g., membrane fractions. 

A fiiU-length 31939 protein or, antigenic peptide fragment of 31939 can be used as 
30 an immunogen or can be used to identify anti-3 1 939 antibodies made with other 

nnmunogens, e.g., cells, membrane preparations, and the like. The antigenic peptide of 
31939 should include at least 8 amino acid residues of the amino acid sequence shown in 
SEQ ID N0:2 and encompasses an epitope of 3 1939. Preferabfy, the antigenic peptide 
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includes at least 10 amino acid residues, more preferably at least IS amino add residues, 
even more preferably at least 20 amino acid residues, and most preferabfy at least 30 amino 
acid residues. 

Fragments of 31939 wliich include residues about 19 to 38, 570 to 595, and 624 to 
5 644 of SEQ ID N0:2 can be used to make, e.g., used as immunogens or used to characterize 
Ihe specificity of an antibody, antibodies against hydrophilic regions of the 31939 proteia 
Similarly, fi-agments of 3 1939 which include residues about 19 to 38, 570 to 595, and 6|M to 
644ofSEQIDNO:2 can be used to make an antibody against a hydrophobic region oi the 
31939 protein; fragments of 31939 which include residues about 38 to 576, about 56 to 362, 

10 or about 378 to 438 of SEQ ID N0:2 can.be used to make an antibody against an 

extracellular region of Ae 3 1 939 protein; a fragment of 3 1 93 9 which includes residues about 
598 to 713 of SEQ ID N0:2 can be used to make an antibody against an intracellular region 
of tiie 31939 protein; a fi-agment of 31939 ^ch include residues about 56 to 362 of SEQ 
ID N0:2 can be used to make an antibocfy against the LRR region of the 3 1939 protein. 

15 Antibodies reactive with, or specific for, any of these regions, or other regions or 

domains described herein are provided. 

Antibodies which bind only native 3 1 939 proteni, only denatured or otherwise non- 
native 31939 protein, or which bind both, are wifii in the invention. Antibodies with linear 
or conformational epitopes are within the invention. Conformational epitopes can 

20 sometimes be identified by identifying antibodies that bind to native but not denatured 
31939 protein. 

Preferred epitopes encompassed by the antigenic peptide are regions of 31939 are 
located on the surface of the protein, e.g., hydrophilic regions, as well as regions with high 
antigenicity. For example, an Emini surface probability analysis of tiie human 31939 

25 protein sequence can be used to mdicate the regions that have a particularly high probability 
of being localized to tfie surface of the 3 1 93 9 protein and are thus likely to constitute surfece 
residues useful for targeting antibody production. 

In a preferred embodiment the antibody can bind to the ^acellular portion of the 
31939 protein, e.g., it can bind to a whole cell which expresses &e 31939 protein. In 

30 another embodiment, the antibody binds an intracellular portion of the 31939 protein. In 
preferred embodiments antibodies can bind one or more of purified antigen, membrane 
associated antigen, tissue, e.g., tissue sections, whole cells, preferably living cells, lysed 
cells, cell firactions, e.g., membrane fi:actions. 
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The anti-31939 antibody can be a single chain antibody. A single-chain antibody 
(scFV) may be engineered (see, for example, Colcher, D. et al {1999) Ann N YAcad Sci 
880:263-80; and Reiter, Y. (1996) Clin Cancer Res 2:245-52). The single chain antibody 
can be dimerized or multimerized to generate multivalent antibodies having specificities for 
5 different epitopes offhe same target 31939 proteia 

In a preferred embodiment the antibody has: effector function; and can fix 
complement In otiier embodiments the antibody does not; recruit effector cells; or fix 
complement 

In a preferred embodiment, the antibody has reduced or no ability to bind an Fc 
10 receptor. For example., it is a isotype or subtype, firagmmt or other mutant, which does not 
siq>port bindmg to an Fc receptor, e.g., it has a mutagmized or deleted Fc receptor binding 
region. 

In a preferred embodiment, an anti-31939 antibody alters (e.g., increases or 
decreases) the extracellular ligand binding activity of a 3 1939 polypeptide. For example, the 

15 antibody can bind at or in proximity to the active site, e.g., to an epitope that includes a 
residue located from about 56 to 362 of SEQ ID N0:2. 

The antibody can be coupled to a toxin, e.g., a polypeptide toxin, e,g, ricin or 
diphtheria toxin or active fragment hereof, or a radioactive nucleiis, or imaging agent, e.g, a 
radioactive, enzymatic, or ottier, e.g., imaging agent, e.g., aNMR contrast agent Labels 

20 which produce detectable radioactive emissions or fluorescence are preferred. 

An anti-31939 antibody (e.g., monoclonal antibody) can be used to isolate 31939 by 
standard techniques, such as affinity chromatography or immunoprecipitation. Moreover, an 
anti-31939 antibody can be used to detect 31939 protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the protein. 

25 Anti-31939 antibodies can be used diagnostically to monitor protein levels in tissue as part 
of a clinical testing procedure, e.g., to determine the efficacy of a given treatment regimea 
Detection can be facilitated by coupling (i.e., physically linking) the antibody to a detectable 
substance (i.e., antibody labelling). Examples of detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 

30 materials, and radioactive materials. Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, p-galactosidase, or acetyldiolinesterase; examples of 
suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples 
of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
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isotbiocyanate, rhodamine, dichlorotriazmylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of ' 
bioluminesceirt materials include ludferase. luciferin, and aequorin, and examples of 
suitable radioactive material include '^'l, or ^ 

5 The invention also includes a nudeic add whidi encodes an anti-31939 antibody, 

e.g., an anti-31939 antibody described herein. Also included are vectors which include the 
nucleic acid and cells transformed wilh the nucleic acid, particularly cells whidi are usdEW 
for producing an antibody, e.g., mammaHan ceils, e:g. CHO or lynq)hatic cdls. j 
The invention also includes cell lines, e.g., hybridomas, which make an anti-31939 

10 antibody, e.g., and antibody described herdn, and method ofusing said cdls to make a 

31939 antibo(fy. 

i^Aoomhinant Exnressi on Vectors. Host rdls and Genetically Rnpineered CgUs 

In another aspect, the invention indudes, vectors, preferably expression vectors, 

15 containing anucleic acid encoding a polypeptide described herein. As used herein, theterm 
"vector" refers to a nuddc acid molecule capable of transporting another nucldc add to 
which it has been linked and can include a plasmid, cosmid or viral vector. The vector can 
be capable of autonomous repUcation or it can integrate into a host DNA. Viral vectors 
indude, e.g., repUcation defective retroviruses, adenoviruses and adeno-assodated viruses. 

20 A vector can include a 31939 nucleic acid in a form suitable for expression of the 

nucldc add in a host cell Preferably the recombinant expression vector includes one or 
more regulatory sequences operativdy linked to the nucleic add sequence to be expressed. 
The tarn "regulatory sequence" includes promoters, enhancers and other expression control 
dements (e.g., polyadenylation signals). Regulatory sequences indude tiiose whidi direct 

25 constitutive expression of a nudeotide sequence, as well as tissue-spedfic regulatory and/or 
indudble sequences. The design of the e:q)ression vector can depend on sudi fectors as the 
choice of the host cell to be transformed, the levd of expression of protein desired, and the 
like. The expression vectors of the invention can be introduced into host cells to thereby 
produce proteins or polypeptides, including fusion protdns or polypeptides, encoded by 

30 nucldc acids as described herdn (e.g., 31939 protdns, mutant forms of 31939 proteins, 
fusion proteins, and tiie like). 

The recombinant expression vectors of the invention can be designed for expression 
of 31939 proteins in prokaiyotic or eukaryotic cells. For example, polypeptides of the 
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invention can be expressed inE. coll, insect cells (e.g., using baculovinis expression 
vectors), yeast cells or mammalian cells. Suitable host cells are discussed further in 
Goeddel, (1990) Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA. Alternatively, the recombinant expression vector can be transcribed 
5 and translated in vitro, for example using T7 promoter regulatory sequences and T7 
pofymerase. 

Expression of proteins in prokaiyotes is most often carried out in E, coli with vectors 
containing constitutive or inducible promoters directing the expression of eilfaer fusion or 
non-fusion protems. Fusion vectors add a number of amino acids to a protein encoded 
10 therein, usually to Ihe amino terminus of the recombinant proteia Such fusion vectors 
typically serve three purposes: 1) to increase expression of recombinant protein; 2) to 
increase the solubiUly of the recombinant protein; and 3) to aid in flie purification of the 
recombinant protein by acting as a ligand in affinily purification. Often, a proteolytic 
cleavage site is introduced at tiie junction of the fusion moiely and tiie reconibinant protein 
15 to enable separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fiision protein. Such en:^mes, and Iheir cognate recognition sequences, 
include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors include 
pGEX (Phannacia Biotech IncrSmith; D;B. and Johnson, KS. (1988) Gene 67:3M0), 
pMAL (New England Biolabs, Beverly, MA) and pRTTS (Pharmacia, Piscataway, NJ) vs*ich 
20 fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, 
to Ihe target recombinant proteia. 

Purified fusion proteins can be used in 31939 activity assays, (e.g,, direct assays or 
competitive assays described in detail below), or to generate antibodies specific for 31939 
proteins. In a preferred embodiment, a fusion protein expressed in a retroviral expression 
25 vector of the present invention can be used to infect bone marrow cells which are 

subsequently transplanted into irradiated recipients. The pathology of the subject recipient 
is then examined after sufBcimt time has passed (e.g,, six weeks). 

To maximize recombinant protein expression in E. coli is to express the protein in a 
host bacteria with an impaired capacity to proteolyticaify cleave the recombinant protein 
30 (Gottesman, S., (1 990) Gene Expression Technology: Methods in Enzymology 1 85, 
Academic Press, San Diego» California 1 19-128). Another strategy is to alter tfie nucleic 
acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each ammo acid are Ifaose preferentially utilized in £ coli (Wada et 
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a/., (1992) Nucleic Acids Res. 20:2111-2118). Such alteration of nucleic acid sequences of 
the invention can be carried out by standard DNA synthesis techniques. ' 

The 3 1939 expression vector can be a yeast expression vector, a vector for 
expression in insect cells, e.g., a baculovirus expression vector or a vector suitable for 
S expression in mammalian cells. 

When used in manmiahan cells, the expression vector^s control fimctions can be | 
provided by viral regulatory elements. For example, commonly used promoters are derived 
from polj^oma, Adenovmis 2, cytomegalovirus and Simian Virus 40. | 
In anoflier embodiment the promoter is an inducible promoter, e.g., a promoter 
10 regulated by a steroid hormone, by a polypeptide hormone (e.g., by means of a signal 
transduction paihwsQ^), or by a heterologous polypeptide (e.g., the tetracycline-inducible 
systems, "Tet-On" and 'Tet-OflP'; see, e.g., Clontech Inc., CA, Gossen and Bujard (1992) 
Proc. Natl Acad. Sal USA 89:5547, and Paillard (1989) Human Gene Therapy 9:983). 

In another embodiment, the recombinant mammalian expression vector is capable of 
15 directing expression of the nucldc acid preferentially in a particular cell lype (e.g., tissue- 
specific regulatory elements are used to express the nucleic acid). Non-limiting examples of 
suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(}9Z1) Genes Dev. l:26%-m\ lymphoid-specific promoters (Calame and Eaton {\9%%)Adv. 
Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and Baltimore 
20 (1989) EMBOJ, 8:729-733) and immunoglobulins (Baneqi et al (1 983) Cell 33:729-740; 
Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters (e.g., the 
neurofilamant promoter; Byme and Ruddle (1989) Proc. Natl Acad. Scl USA 86:5473- 
5477), pancreas-specific promoters (Edlund etal (1985) Science 230:912-916), and 
mammaiy gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 
25 and European Application PubUcation No. 264,166). Developmentally-regulated promoters 
are also encompassed, for example, the murine box promoters (Kessel and Gruss (1990) 
Science 249:374-379) and die o^fetoprotein promoter (Campes and Tilg^man (1989) Genes 
Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
30 molecule of the invention cloned into tiie expression vector in an antisense orientatioa 
Regulatory sequences (e.g., viral promoters and/or enhancers) operatively linked to anucleic 
add cloned in the antisense orientation can be chosen which direct the constitutive, tissue 
specific or cell type specific expression of antisense KNA in a variety of cell ^es. The 
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antisense expression vector can be in fee form of a recombinant plasmid, phagemid or 
attenuated virus. 

Another aspect Ihe invention provides a host ceB which includes a nucleic acid 
molecule described herein, e.g., a 31939 nucleic acid molecule within a recombinant 
5. expression vector or a 31939 nucleic add molecule containing sequences which allow it to 
homologouslyrecombine into a specific site ofthe host ceU's genome. The terms "host ceU" 
and ♦'recombinant host ceU" are used interchangeably herein. Such terms refer not only to 
the particular subject cefl but to Ae progeny or potential progeny of such a celL Because 
certain modifications may occur in succeeding generations due to either mutation or 

10 enviromnental influences, such progeny may not, in feet, be identical to the parent cell, but 
are stiU included within tiie scope of the term as used herein. 

A host ceU can be any prokaryotic or eukaiyotic cell. For example, a 31939 protein 
can be expressed in bacterial ceUs (such as E. coli), insect cells, yeast or mammalian cells 
(such as Chinese hamster ovaiy cells (CHO) or COS cells). Other suitable host cells are 

15 known to tixose skilled in tiie art. 

Vector DNA can be introduced into host cells via conventional transformation or 
transfection techniques. As used herein, the terms "transformation" and "transfection" are 
intended to refer to a variety of art-recognized techniques for introducing foreign nucleic 
add (e.g., DNA) into a host cell, including caldum phosphate or caldum chloride co- 

20 precipitation, DEAE-dextran-mediated transfection, Upofection, or electroporaiion. 

Ahost cdl of the invention can be used to produce (i.e., express) a 31939 protein. 
Accordingly, the invention fiirther provides methods for produdng a31939 protein using the 
host cdls of the mventioa In one embodiment, the method includes culturing the host ceU 
of the invention (into which a recombinant expression vector encoding a 31939 protein has 

25 beenmtroduced)masuitablemediumsudi1hata31939protdnisproduced. Inanolher 
embodiment, the method further includes isolating a 3 1939 protein firom fte medium or &e 
host cell. 

In another aspect, the invention features, a ceU or purified preparation of cells whidi 
include a 31939 transgene. or which otherwise misexpress 31939. The ceU preparation can 
30 consist of human or non-human cells. e.g.. rodent cells, e.g.. mouse or rat cells, rabbit cells, 
or pig ceUs. In preferred embodiments, the cell or cells include a 31939 transgene. e.g.. a 
heterologous form ofa31939,e.g..agene derived firomhumans (in the case ofanon-human 

ceU). The 31939 transgene can be misexpressed, e.g., overexpressed or underexpressed. In 
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Other preferred embodiments, flie cell or cells include a gene ftat mis-expresses an 
endogenous 31939, e.g., a gene Hhe expression of which is disrupted, e.g., a knockout, l^uch 
cells can serve as a model for studying disorders that are related to mutated or mis-expressed 
3 1 93 9 alleles or for use in drug screening. 
5 In another aspect, the invention features, a human cell, e.g., a hematopoietic stem 

cell or a fibroblast, transformed with nucleic acid vrfiich encodes a subject 31939 
polypeptide. 

Also provided are cells, preferably human cells, e.g., human hematopoietic or 
fibroblast cells, in which an endogenous 31939 is under the control of a regulatory sequence 
10 that does not normalty control the expression oflhe endogenous 31939 gene. The 
expression characteristics of an endogenous gene within a cell, e.g., a cell line or 
microorganism, can be modified by inserting a heterologous DNA regulatory element into 
the genome of the cell such that the inserted regulatory element is operably linked to the 
endogenous 31939 gene. For example, an endogenous 31939 gene which is 
15 'transcriptionally silent,** e.g., not normally expressed, or expressed only at veiy low levels, 
may be activated by inserting a regulatory element which is capable of promoting the 
expression of a normally expressed gene product in tiiat cell. Techniques such as targeted 
homologous recombinations, can be used to insert the heterologous DNA as described in, 
e.g., Chappel, US 5,272,071; WO 91/06667, published in May 16, 1991. 

20 In a preferred embodiment, recombmant cells described herein can be used for 

replacement therapy m a subject. For example, anucleic acid encoding a 3 1939 polypeptide 
operably linked to an inducible promoter (e.g., a steroid hormone receptor-regulated 
promoter) is introduced into a human or nonhuman, e.g., mammalian, e.g., porcine 
recombinant cell. The cell is cultivated and encapsulated in a biocompatible material, such 

25 as poly-lysine alginate, and subsequentiy unplanted into the subject See, e.g., Lanza (1996) 
Nat Biotechnol 14:1107; J6\detal (200l)NaL Biotechnol 19:35; and U.S. Patent No. 
5,876,742. Production of 31 939 polypeptide can be regulated in the subject by 
administering an agent (e.g,, a steroid honnone) to the subject In one embodhnent, the 
expressed 31939 polypeptide is an extracellular fiagment on the 31939 polypeptide, e.g., 

30 about amiao acids 38 to 576 or about 56 to 362 of SEQ E) N0:2. In another preferred 
embodiment; tiie implanted recombinant cells express and secrete an antibody specific for a 
31939 polypeptide. The antibody can be any antibody or any antibody derivative described 
h^eirL 
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The invention provides non-human transgenic animals. Sudi animals are useful for 
studying the function and/or activity of a 31939 protein and for identifying and/or evaluating 
modulators of 3 1939 activity. As used herein, a "^ansgenic animal" is a non-human animal, 
5 preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more 
of the cells of the animal includes a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, and the like. 
A transgene is exogenous DNA or a rearrangement, e.g., a deletion of endogenous 
chromosomal DNA, wiiich preferably is integrated into or occurs in the genome of the cells 

10 of a transgenic animal. A transgene can direct the expression of an encoded gene product in 
one or more cell types or tissues of the transgenic animal, other transgenes, e.g., a knockout, 
reduce expression. Thus, a transgenic animal can be one in vMch an endogenous 3 1939 
gene has been altered by, e.g., by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a ceU of the animal, e.g., an embryonic 

15 cell of the animal, prior to development of the animal. 

Intronic sequences and polyadenylation signals can also be included in the transgene 
to increase tiie efiBciency of expression of tiie transgme. A tissue-specific regulatory 
sequence(s) can be operably linked to a transgme of tiie invention to dkect expression of a 
31939 protein to particular cells. A transgenic founder animal can be identified based upon 

20 the presence of a 31939 transgene in its genome and/or expression of 31939 mSNA in 
tissues or ceUs of the animals. A transgenic founder animal can dien be used to bre^d 
additional animals canying tiie transgene. Moreover, transgenic animals canymg a 
transgene encoding a 31939 protein can further be bred to other transgaiic animals canying 
otiier transgenes. 

25 3 1 939 proteins or polypeptides can be expressed in transgenic animals or plants, e.g., 

a nucleic acid encoding the protein or polypeptide can be introduced into the genome of an 
animal. In preferred embodunents tiie nucleic acid is placed under the control of a tissue 
specific promoter, e.g., a milk or egg specific promoter, and recovered from the milk or eggs 
produced by the animal. Suitable animals are mice, pigs, cows, goats, and sheep. 

30 The mvention also includes a population of cells from a transgenic animal, as 

discussed, e.g., below. 
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Uses 

The nucleic acid molecules, proteins, protein homologues, and antibodies described 
herein can be used in one or more of the foUowmg methods: a) screening ass^s; b) 
predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring clinical trials, 
5 and pharmacogenetics); and c) methods of treatment (e.g., Aerapeutic and prophylactic). 

The isolated nucleic acid molecules of &e invention can be used, for example, to 
express a 31 939 protein (e.g., via a recombinant expression vector in a host cell in gene 
therapy applications), to detect a 31939 mKNA (e.g., in a biological sample) or a genetic 
alteration in a 31939 gene, and to modulate 31939 activity, as described further below. The 

10 31939 proteins can be used to treat disorders characterized by insufficient or excessive 
production of a 31939 substrate or production of 31939 inhibitors. In addition, the 31939 
proteins can be used to screen for naturally occurring 31939 substrates, to screen for drugs 
or compounds v^ich modulate 31939 activity, as well as to treat disorders characterized by 
insuflBcient or excessive production of 31939 protein or production of 31939 protein forms 

1 5 which have decreased, aberrant or unwanted activity compared to 3 1 939 wild type protein 
(e.g., a cell proliferative or differentiative disorder, e.g., cancer, or a neuronal disorder). 
Moreover, the anti-31939 antibodies of the invention can be used to detect and isolate 31939 
proteins, regulate the bioavailability of 31939 proteins, and modulate 31939 activity. 
A method of evaluating a compound for the ability to interact with, e.g., bind, a 

20 subject 31939 polypeptide is provided. The method includes: contacting die compound 
with the subject 31939 polypeptide; and evaluating ability of die compound to interact with, 
e.g., to bind or form a complex with the subject 3 1939 polypeptide. This method can be 
performed in vitro, e.g., in a cell free system, or in vivo, e.g., in a two-hybrid interaction trap 
assay. This method can be used to identify naturally occurring molecules that interact with 

25 subject 31939 polypeptide. It can also be used to find natural or synthetic inhibitors of 
subject 31939 polypeptide. Screening methods are discussed in more detail below. 

Screening Assays 

The invention provides methods (also referred to herein as "screening assays") for 
30 identifying modulators, i.e., candidate or test compounds or agents (e.g., proteins, peptides, 
peptidomimetics, peptoids, small molecules or other drugs) which bind to 3 1939 proteins, 
have a stimulatory or inhibitory effect on, for example, 31939 expression or 31939 activity, 
or have a stimulatory or inhibitory effect on, for example, the expression or activity of a 



-52- 



wo 01/75105 PCT/USOl/10380 

31939 substrate. Compounds thus identified can be used to modulate Hie activity of target 
gene products (e.g., 31939 genes) in a therapeutic protocol, to elaborate the biological ' 
function of the target gene product, or to identify compounds that disrupt normal target gene 
interactions. 

5 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a 31939 protein or polypeptide or a . biologically active 
portion thereof In ano&er embodimmt, the invention provides assays for screening f 
candidate or test compounds tiiat bind to or modulate an activity of a 31939 protein oij' 
polypeptide or a biologically active portion ftiereof 

10 In one embodiment; an activity of a 31939 protein can be assayed as follows. Cells 

transformed with a nucleic acid which expresses a 3 1939 protein are contacted wiA an 
extracellular ligand or with a stimulating cell. The cell adhesive properties and cell 
proliferative properties of flie transformed cell (e.g., as indicated by a transformation marker 
such a green fluorescent protein) are monitored as is routine in the art 

1 5 Hie test compounds of die presmt invention can be obtained using any of the 

numerous approaches in combinatorial library methods known in Ifae art, includmg: 
biological libraries; peptoid libraries Gibraries of molecules having flie functionalities of 
peptides, but with a novel, non-peptide backbone which are resistant to enzymatic 
degradation but which nevertheless remain bioactive; see, e.g., Zuckermann, R.N. etal 

20 (1994) J. Med Chem, 37:2678-85); spatially addressable parallel solid phase or solution 
phase libraries; synthetic library metiiods requiring deconvolution; the 'one-bead one- 
compound' library method; and ^thetic library methods using aflBnity chromatography 
selection. The biological library and peptoid library approaches are limited to peptide 
libraries, while the other four approaches are applicable to peptide, non-peptide oligomer or 

25 small molecule libraries of compounds (Lam (1 997) Anticancer Drug Des. 12:1 45). 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
for example in: DeWiitetal. (1993) Proa Natl Acad, ScL aSA. 90:6909; ErberflA (1994) 
Proc. Natl Acad ScL USA 91:1 1422; Zuckermann etal (1994). J. Med Chem. 37:2678; 
Cho etal (1993) Science 261:1303; CarreB etal (1994)Angew. Chem. Int. Ed Engl 

30 33:2059; Caiell etal il994)Angew. Chem. Int. Ed Engl 33:2061; and Gallop etal (1994) 
J. Med Chem, 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
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(1993) Nature 364:555-556X bacteria (Ladner, US. Patent No. 5,223,409), spores (Ladner 
U.S. Patent No. 5,223,409), plasmids (CuU etaL {1992) Proc Natl Acad Sci USA 89:1865- 
1869) or on phage (Scott and Smith (1990) Science 249:386-390; Devlin (1990) Science 
249:404-406; Cwiria et al (1990) Proc. Natl Acad Sci. 87:6378-6382; FeUci (1991) J. 
5 MoL Biol 222:301-310; Ladner jM/?ra.). 

In one embodiment, an asssQ^ is a cell-based assays in whidi a cell which expresses a 
31939 protein or biologically active portion thereof is c(mtacted witii a test compound, and 
the ability of tiie test compound to modulate 31939 activity is detenmned. Determinmg tiie 
ability of tiie test compound to modulate 3 1939 activity can be accomplished by monitoring, 

10 for example, extracellular ligand binding, or cell signaling. Hie cell, for example, can be of 
mammalian origm, e.g., himian. 

The ability of the test compound to modulate 31939 binding to a compound, e.g., a 
31939 substrate, or to bind to 31939 can also be evaluated. This can be accomplished, for 
example, by coupling the compound, e.g., the substrate, witii a radioisotope or enzymatic 

iS label such that binding of the compound, e.g., the substrate, to 3 1939 can be determined by 
detecting the labeled compound, e.g., substrate, in a complex, Altematively, 3 1939 could be 
coupled with a radioisotope or enzymatic label to monitor the abihty of a test compound to 
modulate 31939 binding to a 3 1939 substrate in a complex. For example, compomds (e.g., 
31939 substrates) can be labeled with 125^ 35s^ \Aq^ 3h, eitiier directiy or indirectiy, 

20 and the radioisotope detected by direct counting of radioemmission or by scintillation 
counting. Altematively, compounds can be enzymatically labeled with, for example, 
horseradish peroxidase, alkaline phosphatase, or ludferase, and tiie en^matic label detected 
by determination of conversion of an appropriate substrate to product 

Hie ability of a compound (e.g., a 31 939 substrate) to interact with 3 1939 with or 

25 without tiie labeling of any of the interactants can be evaluated. For example, a 

microphysiometer can be used to detect tiie interaction of a compound wilh 31939 without 
the labelmg of either the compound or the 3 1 939. McConnell, H. M. et al (1 992) Science 
257:1906-1912. As used herein, a '"microphysiometer" (e.g., Cytosensor) is an analytical 
instrument that measures tiie rate at which a cell acidifies its environment using a light- 

30 addressable potentiometric sensor (LAPS). Changes in tiiis acidification rate can be used as 
an indicator of the interaction between a compound and 3 1 939. 

In yet another embodiment, a cell-free assay is provided in which a 31939 protein or 
biologically active portion thereof is contacted with a test compound and the ability of the 
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test compound to bind to the 3 1939 protein or biologically active portion thereof is 
evaluated. Preferred biologically active portions of the 31939 proteins to be used in assays 
of the present invention include fragments which participate in interactions with non-31939 
molecules, e.g., fragments with high surface probability scores. 

5 Soluble and/or membrane-bound forms of isolated proteins (e,g., 3 1 939 proteins or 

biologically active portions thereof) can be used in flie cell-free assays of the invention. I 
When membrane-bound fonnsofthe protein are used, it nwy be desirable to utilize a I 
solubilizing agent Examples of such solubilizing agents indude non-ionic detergents ^ch 
asn-octylglucoside, n-dode(^lglucoside, n-dodecylmaltoside, octanoyl-N-methylgJucamide, 

10 decanoyl-N-methylglucamide, Triton® X-l6o, Triton® X-1 14,^^ 

Isotridecypoly(ethylene glycol ether)n, 3.[(3-cholanudopropyl)dimelhylamnmiio]-l -propane 
sulfonate (CHAPS), 3-[(3-cholanudopropyl)dime&ylammmio]-2-hydroxy-l-propane 
sulfonate (CHAPSO), or N-dodecyl=N,N-dimethyl-3-ammonio-l-propane sulfonate. 

Cell-free assays involve preparing a reaction mijrture of the target gene protein and 

15 the test compound under conditions and for a time sufficient to allow the two components to 
interact and bind, thus forming a complex lhat can be removed and/or detected. 

The interaction between two molecules can also be detected, e.g., using fluorescence 
&i&rgy transfer (FET) (see, for example, Lakowicz et al , U.S. Patent No. 5,63 1 , 1 69; 
Stavrianopoulos, etal, U.S. PatentNo. 4,868,103). Afluorophore label on the first, 'donor' 

20 molecule is selected such ftat its emitted fluorescent energy will be absorbed by a 

fluorescent label on a second, 'acceptor' molecule, which in tum is able to fluoresce due to 
tiie absorbed energy. Altemately, the 'donor* protein molecule may simply utilize the 
natural fluorescent energy of tryptophan residues. Labels are chosen fliat emit diflfermt 
wavelengtibs of light, such that the 'acceptor' molecule label m^ be differentiated from that 

25 of tiiie *d(mor'. Since the efficiency of aniergy transfer between the labels is related to the 
distance separating the molecules, the spatial relationship between die molecules can be 
assessed, hi a situation m which bindmg occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label m the assay should be maximal. An FET bmding 
event can be conveniently measured through standard fluorometric detection means well 

30 knownintheart(e,g.,usmgafluoruneter). 

In another embodiment, determining tiie ability of the 31939 protein to bind to a 
target molecule can be accomplished using real-time Biomolecular Interaction Analysis 
(BIA) (see, e.g., Sjolander, S. and Urbaniczky, C. {\99\) Anal Chem. 63:2338-2345 and 
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Szabo etal (1995) Curr, Opin, Struct. Biol 5:699-705). "Surface plasmon resonance*' or 
**BIA'* detects biospecific interactions in real time, witfiout labeling any of the interactants 
(e.g., BIAcore). Changes in the mass at the binding surface (indicative of a binding event) 
result in alterations of the relfractive index of light near the surface (the optical phenomenon 
5 of surface plasmon resonance (SPR)), resulting in a detectable signal which can be used as 
an indication of real-time reactions between biological molecules. 

In one embodiment, the target gene product or the test substance is anchored onto a 
solid phase. The target gene producl/test compound complexes anchored on the solid phase 
can be detected at the end of ttie reaction. Preferably, the target gene product can be 
10 anchored onto a solid surface, and the test compound, (yvtddti is not anchored), can be 
labeled, ^ther directly or indirectly, with detectable labels discussed herein. 

It may be desirable to immobilize either 31939, an anti-31939 antibodjr or its target 
molecule to facilitate separation of complexed from uncomplexed forms of one or both of 
Hie proteins, as well as to accommodate automation of the assay. Binding of a test 
15 compound to a 3 1 939 protein, or interaction of a 3 1 939 protein with a taiget molecule in the 
presence and absence of a candidate compound, can be accomplished in any vessel suitable 
for contaming the reactants. Examples of sudi vessels include microtiter plates, test tubes, 
and micro-centrifuge tubes.- In one embodiment, a fusion protein can be provided which 
adds a domain that allows one or both of the proteins to be bound to a matrix. For example, 
20 glutathione-S-transferase/3 1939 fusion proteins or glutathione-S-transferase/target fusion 
proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St Louis, MO) 
or glutathione derivatized microtiter plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or 3 1 939 protein, 
and the mixture incubated under conditions conducive to complex formation (e.g., at 
25 physiological conditions for salt and pH). Following incubation, the beads or microtiter 
plate wells are washed to remove any unbound components, the matrix immobilized in the 
case of beads, complex determined either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
31939 binding or activity determined usmg standard techniques. 
30 Other techniques for immobilizing either a 3 1939 protdn or a target molecule on 

matrices include using conjugation of biotin and streptavidia Biotinylated 31939 protem or 
target molecules can be prepared from biotin-MIS (N-hydrox$^-succinimide) using 
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techniques known in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). ' 

In order to conduct &e assay, the non-immobilized component is added to the coated 
surface containing the andiored component After the reaction is complete, unreacted 
5 components are removed (e.g., by washing) under conditions such that any complexes 
formed will remain immobilized on the solid sur&ce. Hie detection of complexes anchored 
on the solid surface can be accomplished in a number of ways. Where the previously Jpnr 
inmiobilized component is pre-labeled, flie detection of label immobilized on the surface 
indicates that complexes were formed. Where the previously non-immobilized compon^t 

10 is not pre-labeled, an indirect label can be used to detect complexes anchored on the suiface; 
e.g., using a labeled antibody specific for tiie immotfilized component (the antibody, in turn, 
can be directiy labeled or indirectiy labeled witii, e.g., a labeled anti-Ig antibody). 

In one embodiment tiiis assay is performed utilizing antibodies reactive witix 31939 
protein or target molecules but yAdch do not interfere with binding of the 31939 protein to 

15 its target molecule. Such antibodies can be derivatized to the wells of tiie plate, and 

unbound target or 3 1939 protein trapped in the wells by antibody conjugation. Methods for 
detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the 3 1939 
protein or target molecule, as well as enzyme-linked assays which rely on detecting an 

20 enzymatic activity associated with the 3 1939 protein or target molecule. 

Alternatively, cell free assays can be conducted in a liquid phase. In such an assay, 
the reaction products are separated from unreacted components, by any of a number of 
standard techniques, including but not limited to: differential centrifiigation (see, for 
example, Rivas, G., and Minton, A.P., (1993) Trends Biochem Sci 18:284-7); 

25 chromatography (gel filtration chromatography, ion-exchange chromatography); 

electrophoresis (see, eg., Ausubel, F. et aL, eds. Current Protocols in Molecular Biology 
1999, J. Wiley: New York); and immunoprecipitation (see, for example, Ausubel, F. era/., 
eds. (1 999) Current Protocols in Molecular Biology, X Wiley: New York). Such resins and 
chromatographic techniques are knovm to one skilled in tiie art (see, e.g., Heegaard, N.H., 

30 {199«) JMolRecoffiit 11:141'^; Hage, D.S., and Tweed, SA (1997) J Chromatogr B 
Biomed SciAppl 699:499-525). Further, fluorescence energy transfer may also be 
convenientiy utilized, as described herein, to detect binding without further purification of 
tiie comply from solution. 
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In a preferred embodiment, the assay includes contacting the 3 1939 protein or 
biologically active portion thereof with a known compound which binds 31939 to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with a 31939 protein, wherein determining the ability 
5 of the test compound to interact with a 3 1 93 9 protein includes determining the ability of the 
test compound to preferentially bind to 3 1 939 or biologically active portion thereof or to 
modulate the activity of a target molecule^ as compared to the known compound. 

Tlie target gem products of the invention can, in vivo, interact witti one or more 
cellular or extracellular macromolecules, such as proteins. For fhe purposes of tius 

10 discussion, such cellular and extracellular macromolecules are referred to herein as ''binding 
partners." Compounds that disrupt such interactions can be usefiil in regulating the activity 
of the target gene product. Such compounds can include, but are not limited to molecules 
such as antibodies, peptides, and small molecules. The preferred target genes/products for 
use in Ais embodiment are the 31939 genes herein identified. In an alternative embodiment, 

15 the invention provides methods for determining the ability of the test compound to modulate 
the activity of a 31939 protein through modulation of the activity of a downstream effector 
of a 31939 target molecule. For example, the activity of the effector molecule on an 
appropriate target can be determined, or the binding of the effector to an appropriate target 
can be determined, as previously described, 

20 To identify compounds that interfere with the interaction between the target gene 

product and its cellular or extracellular binding partner(s), a reaction mixture containing the 
target gene product and the binding partner is prepared, under conditions and for a time 
sufScient, to allow the two products to form complex. In order to test an inhibitory agent, 
fte reaction mixture is provided in the presence and absence of the test compound. The test 

25 compound can be mitially included in fte reaction mixture, or can be added at a time 

subsequent to the addition of the target gene and its cellular or extracellular binding partner. 
Control reaction mixtures are incubated wifliout the test compound or with a placebo. Hie 
formation of any complexes bettveen the target gene product and the cellular or extracellular 
binding partner is ttien detected. The formation of a complex in the control reaction, but not 

30 in the reaction mixture containing the test compound, indicates that the compound int^eres 
with the interaction of the target gene product and the interactive binding partner. 
Additionally, complex formation within reaction mixtures containing the test compound and 
normal target gene product can also be compared to complex formation within reaction 
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mixtures containing the test compound and mutant target gene product This comparison 
can be important in those cases wherein it is desirable to identify compounds that disrupt 
interactions of mutant but not normal target gene products. 

These assays can be conducted in a heterogeneous or homogeneous format 
5 Heterogeneous assays involve anchoring either the target gene product or the binding 
partner onto a solid phase, and detecting complexes anchored on die solid phase at the end 
of flie reactioa In homogeneous ass^s, the entire reaction is carried out in a liquid phase. 
In either approach, the order of addition of reactants can be varied to obtain different | 
information about the compounds being tested. For example, test compounds fliat interfere 

10 witfi the interaction between the target gsno products and the binding partners, e.g., by 
competition, can be identified by conductmg the reaction in the presence of flie test 
substance. Alternative^^, test compounds that disrupt preformed complexes, e.g., 
compounds with higjier binding constants that displace one of the componmts firom fiie 
complex, can be tested by adding the test compound to the reaction mixture after complexes 

15 have been formed. Hie various formats are briefly described below. 

In a heterogeneous assay system, either the target gene product or the interactive 
cellular or extracellular binding partner, is anchored onto a solid surfece (e.g., a microtiter 
plate), while the non-anchored species is labeled, either directly or indirect^. The anchored 
species can be immobilized by non-covalent or covalent attachments, Altematively, an 

20 immobilized antibody specific for the species to be anchored can be used to anchor the 
species to the solid surface. 

In order to conduct the assay, the partner of the immobilized species is exposed to the 
coated surface with or without the test compound. After the reaction is complete, unreacted 
components are removed (e.g., by washmg) and any complexes formed will remain 

25 immobilized on the solid surface. Where the non-immobiUzed species is pre-labeled, the 
detection of label immobilized on the surfiace indicates that complexes were formed. Where 
the non-immobilized species is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surfece; e.g., using a labeled antibody specific for the initially 
non-immobilized species (the antibody, in turn, can be dh-ectly labeled or indirectly labeled 

30 with, e.g., a labeled anti-Ig antibody). Depending upon the order of addition of reaction 
components, test compounds that inhibit complex formation or fliat disrupt preformed 
complexes can be detected. 
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Alternatively, Ihe reaction can be conducted in a liquid phase in the presence or 
absence of the test compound, the reaction products separated from unreacted components, 
and complexes detected; e.g., using an immobilized antibody specific for one of Ihe binding 
components to anchor any complexes formed in solution, and a labeled antibody specific for 
5 the other partner to detect anchored complexes. Again, depending upon Ihe order of 

addition of reactants to the liquid phase, test compounds lliat inhibit complex or that disrupt 
preformed complexes can be identified. 

In an altemate embodiment of the invention, a homogeneous assay can be used. For 
example, a preformed complex of the target gene product and Ihe interactive cellular or 

10 extracellular binding partner product is prepared in Ifaat eiflier the target gene products or 
their binding partners are labeled, but the signal generated by the label is quenched due to 
complex formation (see, e.g., U.S. Patent No. 4,109,496 tiiat utilizes this approach for 
immunoassays). The addition of a test substance that competes with and displaces one of 
the species firom tiie preformed complex will result in the generation of a signal above 

15 background. Intius way, test substances that disrupt target gene product-binding partner 
interaction can be identified. 

In yet another aspect, Ihe 3 1 939 proteins can be used as "bait proteins" in a two- 
hybrid assay or three-hybrid assay (see, e:g., U.S. Patent No: 5,283,3 17; Zervos etal (1993) 
Ce// 72:223-232; Madurae/a/. (1993)J. Biol Chem. 268:12046-12054; BaxtdetaL (1993) 

20 Biotechniques 14:920-924; Iwabuchi etal (1993) Oncogene 8:1693-1696; and Brent 
WO94/10300), to identify other proteins, which bind to or interact with 31939 C*31939- 
binding proteins'' or *31939-bp") and are involved in 31939 activity. Such 31939-bps can 
be activators or inhibitors of signals by the 31939 proteins or 31939 targets as, for example, 
downstream elements of a 31939-mediated signaling pathw^. 

25 The two-hybrid system is based on the modular nature of most transcription factors, 

which consist of separable DNA-bmding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct,.flie gene that codes for a 3X939 protein is 
fiised to a gene encoding the DNA binding domain of a known transcription &ctor (e.g., 
GAL-4). In the other construct, a DNA sequmce, fi-om a library of DNA sequences, that 

30 encodes an unidentified proteia ("prey" or "sample'^ is fused to a gene that codes for the 
activation domain of tiie knovm transcription fiwtor. (Alternatively the: 31939 protein can 
be the fused to the activator domam.) If tiie *1)air and tiie *lpr€y*' protems are able to 
interact, in vivo, forming a 31939-dependOTt complex, the DNA-binding and activation 



-60. 



wo 01/75105 



PCT/DSOl/10380 



domains of fte transcription factor are brought into close proximity. This proximity allows 
transcription of a reporter gene (e.g., lacZ) which is operably linked to a transcriptional' 
regulatory site responsive to the transcription factor. Expression of the reporter gene can be 
detected and cell colonies containing the functional transcription factor can be isolated and 
5 used to obtain the cloned gene which encodes the protein which interacts with the 31 939 
protein. , 

In another embodiment, modulators of 3 1 939 expression are identified. For example, 
a cell or cell free mixture is contacted with a candidate compound and the expression jbf 
31939 mRNA or protein evaluated relative to the level of expression of 31939 mRNA or 

10 protein in the absence of the candidate compound. When e3q)ression of 31939 mRNA or 
protein is greater in the presence of the candidate compound than in its absmce, the 
candidate compound is identified as a stimulator of 31939 mRNA or protein e3q)ressioa 
Alternatively, when expression of 3 1939 mRNA or protein is less (statistically significantly 
less) in the presence of Ifae candidate compound than in its absence, the candidate compound 

15 is identified as an inhibitor of 31939 mRNA or protein expression. The level of 31939 
mRNA or protein expression can be determined by methods described herein for detecting 
31939 mRNA or protein. 

-In another aspect, the invention pertains to a combination of two or more of the 

assays described h^ein. For example, a modulating agent can be identified using a cell- 

20 based or a cell fi-ee assay, and the ability of the agentto modulate the activity of a 31939 
protein can be confirmed in vivo, e.g., in an animal such as an animal model for a cell 
proliferative or diJBFerentiative disorder, e.g., cancer, or a neuronal disorder. 

This invention fiirlher pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to fiirther use an agent 

25 identified as described herein (e.g., a 3 1939 modulating agent, an antisense 31939 nucleic 
add molecule, a 31939-specific antibody, or a 31939-binding partner) in an appropriate 
animal model to determine the efficacy, toxicity, side effects, or mechanism of action, of 
treatment with such an agent Furthermore, novel agents identified by the above-described 
screening assays can be used for treatments as described herein. 



-61. 



wo 01/75105 



PCT/USOl/10380 



Detection Assays 

Portions or fragments of the nucleic acid sequence identified herein can be used as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome e.g., to locate gene regions associated with genetic 
5 disease or to associate 3 1939 Wth a disease; (ii) identify an individual firom a minute 
biological sample (tissue typing); and (iii) aid in forensic identification of a biological 
sample. These applications are described in the subsec^ons below. 

Chromosome Mapping 

10 The 3 1939 nucleotide sequences or portions tiiereof can be used to map the location 

of tiie 3 1939 genes on a chromosome. This process is called chromosome mapping. 
Chromosome mapping is useful in correlating the 31939 sequences with genes associated 
witii disease. 

Briefly, 3 1939 genes can be mapped to chromosomes by preparing PGR primers 
15 (preferably 15-25 bp in length) fi-om the 31939 nucleotide sequences. These primers can 
then be used for PGR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the 31939 
sequences will yield an amplified fi*agment. 

A panel of somatic cell hybrids in which each cell line contains either a single human 
20 chromosome or a small number ofhuman chromosomes, and a fiill set of mouse 

chromosomes, can allow easy mapping of individual genes to specific human chromosomes. 
(D"Eustachio P. etal (1983) Science 220:919-924). 

Other mapping strategies e.g., in situ hybridization (described in Fan, Y. etaL (1990) 
Proc. Natl AccuL Sci. USA, 87:6223-27), pre-screening witii labeled flow-sorted 
25 chromosomes, and pre-sdection by hybridization to chromosome specific cDNA libraries 
can be used to map 3 1939 to a chromosomal locatioa 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can fiuifaer be used to provide a precise chromosomal location in one 
step. The FISH technique can be used with a DNA sequence as short as 500 or 600 bases. 
30 However, clones larger than 1,000 bases have a higher likelihood of bmding to a unique 
chromosomal location with suflBcient signal intensity for simple detection. Preferably 1,000 
bases, and more preferably 2,000 bases will suffice to get good results at a reasonable 
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amount of time. For a review of this technique, see Verma et al^ Human Chromosomes: A 
Manual of Basic Techniques ((1988) Pergamon Press, New York). ^ 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
5 marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping purposes. Coding sequences are 
more likely to be conserved within gene faniiUes, thus increasing ft^ | 
hybridizations during chromosomal mapping. \ 

Once a sequence has been mq)ped to a precise chromosomal location, the physical 

10 position of the sequence on the chromosome can be correlated with genetic map data. (Such 
data are found, for example, in V. McKusick, Mendelian Inheritance in Man, available on- 
line through Johns Hopkins University Welch Medical Libraiy). Hie relationship between a 
goie and a disease, mapped to the same chromosomal region, can tiien be identified through 
linkage analysis (co-inheritance of physically adjacent genes), described in, for example, 

15 Egdand, J. et al (1987) Nature, ilSHZZ-m, 

Moreover, differences in the DNA sequences betwem individuals affected and 
unaffected witii a disease associated wilfa the 31939 gene, can be detemuned. If a mutation 
is observed in some or all of the affected individuals but not in an/ unaffected individuals, 
tiim &e mutation is likely to be Ihe causative agent of Ifae particular disease. Comparison of 

20 affected and imaffected individuals generally involves first looking for structural alterations 
in the chromosomes, such as deletions or translocations that are visible firom chromosome 
spreads or detectable using PGR based on that DNA sequoice. Ultimately, complete 
sequencing of genes from several individuals can be performed to confirm the presence of a 
mutation and to distinguish mutations from polymorphisms. 

25 

Tissue Typing 

31939 sequences can be used to identify individuals from biological samples using, 
e.g., restriction firagment length polymorphism (RFUP), In this tedhnique, an individual's 
genomic DNA is digested with one or more restriction enzymes, Ifae fragments separated, 
30 e.g., in a Southern blot, and probed to yield bands for identification. The sequences of the 
present invention are useful as additional DNA markers for RFLP (described in U.S. Patent 
5,272,057), 
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Furthermore, the sequences of the present invention can also be used to determine the 
actual base-by-base DNA sequence of selected portions of an individual's genome. Thus, 
Ae 31939 nucleotide sequences described herein can be used to prepare two PGR primers 
from the 5' and 3' ends of the sequences. These primers can then be used to amplify an 
5 individual's DNA and subsequently sequence it Panels of corresponding DNA sequences 
from individuals, prepared in this manner, can provide imique individual identifications, as 
each individual will have a unique set of such DNA sequences due to allelic differences. 

AlleUc variation occurs to some degree in the coding regions of these sequences, and 
to a greater degree in the noncoding regions. Each of the sequences described herein can, to 

10 some degree, be used as a standard agabist which DNA from an individual can be compared 
for identification purposes. Because greater numbers of poj^orphisms occur in the 
noncoding regions, fewer sequences are necessaiy to differentiate individuals. The 
noncoding sequences of SEQ ID N0:1 can provide positive individual identification with a 
panel of peifa^s 10 to 1,000 primers whidi each yield a noncoding amplified sequence of 

15 100 bases. If predicted coding sequences, sudi as those in SEQ ID N0:3 are used, a more 
appropriate number of primers for positive mdividual identification would be 500-2,000. 

If a panel of reagents from 31939 nucleotide sequences described hereiii is used to 
generate a unique identification database for an individual, those same reagents can later be 
used to identify tissue from ttiat individual. Using the unique identification database, 

20 positive identification of the individual, Uving or dead, can be made from extremely small 
tissue samples. 

Use of Partial 31939 Sequences in Forensic Biology 

DNA-based identification techniques can also be used in forensic biology. To make 

25 such an identification, PCR technology can be used to amplify DNA sequences taken from 
very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, 
saliva, or semen found at a crime scene. The amplified sequence can then be compared to a 
standard, thereby allowing idatification of the origin of flie biological sample. 

The sequences of the present invention can be used to provide polynucleotide 

30 reagents, e.g., PCR primers, targeted to specific lod in tiie human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example^ providing 
anotiier 'Identification marka:" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequaice infi>rmation can be used for 
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identification as an accurate alternative to patterns formed by restriction enzyme generated 
fi'agments. Sequences targeted to noncoding regions of SEQ ID NO: 1 (e.g., fragments ^ 
derived Jfrom the noncoding regions of SEQ ED NO: 1 having a length of at least 20 bases, 
preferably at least 30 bases) are particularly appropriate for this use. 
5 The 3 1939 nucleotide sequences described herem can further be used to provide 

polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for example, 
an in situ hybridization technique, to identify a specific tissue. This can be veiy usefiil in 
cases where a forensic pathologist is presented with a tissue of unknown origin. Panels' of 
such 31939 probes can be used to identify tissue by species and/or by organ type. 
10 In a sunilar fashion, these reagents, e.g., 3 1 939 primers or probes can be used to 

screen tissue culture for contamination (i.e. screen for the presence of a mixture of different 
types of cells in a culture). 

Predictive Medicine 

1 S The present invention also pertains to the field of predictive medicine in which 

diagnostic assays, prognostic assays, and monitoring cUnical trials are used for prognostic 
(predictive) purposes to thereby treat an individual. 

Generally, the invention provides, a method of determining if a subject is at risk for a 
disorder related to a lesion in or the misexpression of a gene which encodes 3 1939. 
20 Such disorders include, e.g,, a disorder associated with the misexpression of 3 1 939 

gene; a disorder of the central or peripheral nervous system. 
The method includes one or more of the following: 

detecting, in a tissue of the subject, the presence or absence of a mutation which 
affects the expression of the 3 1939 gene, or detecting the presence or absence of a mutation 
25 in a region whidi controls the expression of the gene, e.g., a mutation in the 5' control 
region; 

detecting, in a tissue of the subject, the presence or absence of a mutation which 
alters the structure of the 31939 gene; 

detecting, in a tissue of tfie subject, the misexpression of the 3 1939 gene, at the 
30 mRNA level, e.g., detecting a non-wild type level of a mRNA ; 

detecting, m a tissue of the subject, the misexpression of the gene, at the protein 
level, e.g,, detecting anon-wild type level of a 31939 polypeptide. 
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In preferred embodiments fhe method includes: ascertaining the existence of at least 
one of: a deletion of one or more nucleotides from the 31939 gene; an insertion of one or 
more nucleotides into the gene, a point mutation, e.g., a substitution of one or more 
nucleotides of the gene, a gross chromosomal rearrangement of the gene, e.g., a 

5 . translocation, inversion, or deletion. 

For example, detecting the genetic lesion can include: (i) providing a probe/primer 
including an oligonucleotide containing a region of nucleotide sequence which hybridizes to 
a sense or antisense sequence from SEQ ID N0:1, or naturally occurring mutants thereof or 
5' or 3* flanking sequences naturally associated with the 3 1 939 gene; (ii) exposing the 

0 probe/piimer to nucleic acid of the tissue; and detecting, by lybridization, e.g., in situ 
hybridization, of the probe/primer to ttie nucleic acid, the presence or absence of the genetic 
lesion. 

In preferred embodiments detecting the misexpression includes ascertaining the 
existence of at least one of: an alteration in ttie level of a messenger KNA transcript of tibe 
5 31939 gene; the presence of anon-wild type splicing pattern of amessenger KNA transcript 
of the gene; or a non-wild type level of 3 1939. 

Methods of the invention can be used prenatally or to determine if a subject's 
offspring will be at risk for a disorder. 

In preferred embodiments the method includes determining the structure of a 31939 
0 gene, an abnormal structure being indicative of risk for the disorder. 

In preferred embodiments the method includes contacting a sample from the subject 
with an antibody to the 3 1939 protein or a nucleic acid, which hybridizes specifically with 
the gene. These and other embodiments are discussed below. 

5 Diagnostic and Prosnostic Assavs 

Diagnostic and prognostic assays of the invention include method for assessmg the 
expression level of 31939 molecules and for identifying variations and mutations in the 
sequence of 31939 molecules. 

Expression Monitoring and Profiling. The presence, level, or absence of 31939 
0 protein or nucleic acid in a biological sample can be evaluated by obtaining a biological 
sample from a test subject and contacting the biological sample with a compound or an 
agent capable of detecting 31939 protein or nucleic acid (e.g., mKNA, genomic DNA) that 
encodes 31939 protein such fliat the presence of 31939 protein or nucleic acid is detected in 
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the biological sample. The tenn '^biological sample" includes tissues, cells and biological 
fluids isolated from a subject, as well as tissues, cells and fluids present within a subject 'A 
preferred biological sample is serum. The level of expression of the 31939 gene can be 
measured in a number of ways, including, but not hmited to: measuring the mKNA encoded 

5 by the 31939 genes; measuring the amount of protein encoded by the 31939 genes; or 
measuring the activity of fiie protein encoded by the 3 1939 g^es. | 
The level of mRNA corresponding to the 31939 gene in a cell can be determined 
both by in situ and by in vitro fommts. I 
The isolated mKNA can be used in hybridization or amplification assays that include, 

10 but are not limited to, Soulhem or Northern analyses, polymerase chain reaction analyses 
and probe arrays. One preferred diagnostic method for the detection of mKNA levels 
involves contacting the isolated mRNA with a nucleic acid molecule (probe) that can 
hybridize to the mKNA encoded by Hie gene being detected. The nucleic acid probe can be, 
for example, afull-lenglh 31939 nucleic acid, such as the nucleic acid of SEQ IDN0:1, or a 

15 portion thereof such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 
nucleotides in length and suflBcient to specifically hybridize under stringent conditions to 
31939 mRNA or genomic DNA. The probe can be disposed on an address of an array, e.g., 
an array described below. Other suitable probes for use in the diagnostic assays are 
described herein. 

20 In one format, mRNA (or cDNA) is immobilized on a surface and contacted with the 

probes, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA fi:om the gel to a membrane, such as nitrocellulose. In an altemative format, the 
probes are hnmobilized on a surface and the mRNA (or cDNA) is contacted with the probes, 
for example, in a two-dimensional gene chip array described below. A skilled artisan can 

25 ad^t known mRNA detection methods for use in detecting the level of mRNA encoded by 
the 31939 genes. 

The level of mRNA in a sample fliat is encoded by one of 31939 can be evaluated 
with nucleic acid amplification, e.g., by rtPCR (Mullis (1987) U.S. Patent No. 4,683,202), 
ligase chain reaction (Barany (1991) Proa Natl Acad Sci. USA 88:189-193), self sustained 
30 sequence replication (Guatelli era/., (1990)Proc. Natl Acad Sci. USA 87:1874-1878), 
transcriptional amplification system (Kwoh etal., (1989), Proc, Natl Acad Set USA 
86:1 173-1 1 77), Q-Beta Replicase (Lizardi et al, (1988) Bio/Technology 6:1 197), rolling 
circle replication (Lizardi etal, U.S. Patent No. 5,854,033) or wy other nuddc acid 
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amplification method, followed by the detection of the amplified molecules using technique 
known in the art. As used herein, amplification primers are defined as being a pair of 
nucleic acid molecules that can anneal to 5' or 3' regions of a gene (plus and minus strands, 
respectively, or vice-versa) and contain a short region in between. In general, amplification 
5. primers are fi-om about 10 to 30 nucleotides in length and flank a region firom about 50 to 
200 nucleotides in length. Under appropriate conditions and with appropriate reagents, such 
primers permit the amplification of a nucleic acid molecule comprising the nucleotide 
sequence flanked by the primers. 

For in situ methods, a cell or tissue sample can be prepared/processed and 

10 immobilized on a support, lypicalfy a glass slide, and then contacted wilfa a probe that can 
hybridize to mRNA that encodes the 31939 grae being anafyzed. 

In another embodiment, fte me&ods fiirifaer contacting a control sample with a 
compound or agent capable of detecting 31939 mSNA, or genomic DNA, and comparing 
the presence of 31939 mKNA or genomic DNA in the control sample with the presence of 

15 31939 mRNA or genomic DNA in the test sample. In still anotiier embodiment, serial 
analysis of gene expression, as described in U.S. Patent No. 5,695,937, is used to detect 
31 939 transcript levels, 

A variety of methods can be used to determine the level of protein encoded by 
31 939. In general, these methods include contacting an agent that selectively binds to the 

20 protein, such as an antibody with a sample, to evaluate the level of protein in the sample. In 
a preferred embodiment, the antibody bears a detectable label. Antibodies can be 
polyclonal, or more preferably, monoclonal. An intact antibody, or a fi-agment thereof (e.g., 
Fab or F(ab*)2) can be used. The term "labeled", with regard to the probe or antibo^, is 
intended to aicompass direct labeling of the probe or antibody by coupling (i.e., physically 

25 linkmg) a detectable substance to the probe or antibody, as well as indirect labeling of the 
probe or antibody by reactivity v^th a detectable substance. Examples of detectable 
substances are provided hereia 

The detection methods can be used to detect 31939 protein in a biological sample in 
vitro as well as in vivo. In vitro techniques for detection of 31939 protein include enzyme 

30 linked immunosorbent assays ^USAs), immunopredpitations, immunofluorescence, 
enzyme immunoassay (EIA), radioimmunoassay (RIA), and Western blot analysis. In vivo 
techniques for detection of 31939 protein include introducing into a subject a labeled anti- 
31939 antibody. For example, the antibody can be labeled witih a radioactive marker whose 
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presence and location in a subject can be detected by standard imaging techniques. In 
another embodiment, the sample is labeled, e.g., biotinylated and then contacted to the ' 
antibody, e.g., an anti-31939 antibody positioned on an antibody array (as described below). 
The sample can be detected, e.g., with avidin coupled to a fluorescent label 

5 In another embodiment, the methods further include contacting the control sample 

with a compound or agent capable of detecting 31939 protein, and comparing the presence 
of 31939 protein in the control sample with the presence of 31939 protein in the test saniple. 

The invention also includes kits for detecting tiie presence of 31939 in a biological 
sample. For example, the kit can include a compound or agent capable of detecting 31 939 

0 protein or mRNA in a biological sample; and a standard. The compound or agent can be 
packaged in a suitable contamer. The kit can fiirttier comprise instructions for using &e Idt 
to detect 3 1 939 protein or nucleic acid. 

For antibody-based kits, the kit can include: (1) a first antibody (e,g., attached to a 
solid support) which binds to a polypeptide corresponding to a marker of the invention; and, 

5 optionally, (2) a second, different antibody which binds to either the polypeptide or the first 
antibody and is conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can include: (1) an oligonucleotide, e.g., a 
detectably labeled oligonucleotide, which hybridizes to a nucleic acid sequence encoding a 
polypeptide corresponding to a marker of the invention or (2) a pair of primers useful for 

0 amplifying a nucleic acid molecule corresponding to a marker of the inventioa The kit can 
also includes a buffering agent, a preservative, or a protein stabiUzing agent The kit can 
also includes components necessary for detecting the detectable agent (e.g., an enzyme or a 
substrate). The kit can also contain a control sample or a series of control samples vAich 
can be assayed and compared to the test sample contained. Each component of the kit can 

5 be enclosed within an individual container and all of the various containers can be within a 
single package, along with instructions for interpreting the results of the assays performed 
using the kit. 

The diagnostic methods described herein can identify subjects having, or at risk of 
developing, a disease or disorder associated with misexpressed or aberrant or unwanted 
0 31939 expression or activity. As used herein, the term ^^mwanted" includes an unwanted 
phenomenon involved in a biological response such as a cell proliferative or differentiative 
disorder, e.g., cancer or a neuronal disorder. 
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In one embodiment, a disease or disorder associated with aberrant or unwanted 
3 1939 expression or activity is identified. A test sample is obtained firom a subject and ' 
31939 protein or nucleic add (e.g., mRNA or genomic DNA) is evaluated, wherein the 
level, e.g., the presence or absence, of 31939 protein prnucldc acid is diagnostic for a 

5 subject having or at risk of developing a disease or disorder associated with aberrant or 
unwanted 3 1939 expression or activity. As used herein, a "test sample" refers to a i 
biological sample obtained from a subject of interest, including a biological fluid (e.g., 
serumX cell sample, or tissue. 

The prognostic assays described herein can be used to determine wheflier a subject 

10 can be administered an agent (e.g., an agonist^ antagonist, peptidomimetic, protein, peptide, 
nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
assodated witii aberrant or unwanted 31939 Repression or activity. For exan^le, such 
methods can be used to determine whether a subject can be effectively treated witii an agent 
for a cell proliferative or differentiative disorder, e.g., cancer. 

15 In another aspect, the invention features a computer medium having a plurality of 

digitally encoded data records. Each data record includes a value representing tiie level of 
expression of 31939 in a sample, and a descriptor of the sample. The descriptor of the 
sample can be an identifier of the sample, a subject from which tiie sample was derived 
(e.g., a patient), a diagnosis, or a treatmoit (e.g., a preferred treatment). In a preferred 

20 embodiment, the data record fiirther includes values representing the levd of expression of 
genes otiier ttian 31939 (e.g., other genes assodated with a 31939-disorder, or otiier genes 
on an anay). The data record can be structured as a table, e.g., a table that is part of a 
database such as a relational database (e.g., a SQL database of tiie Oracle or Sybase database 
mvironments). 

25 Also featured is a mdhod of evaluating a sample. The metiiod includes providing a 

sample, e.g., from the subject, and determining a gene expression profile of tiie sample, 
wherein tiie profile indudes a value representing tiie levd of 31939 expression The 
method can further indude comparing tiie value or tiie profile (i.e., multiple values) to a 
reference value or reference profile. Hie gene expression profile of ttie sample can be 

30 obtained by any of tiie mefliods described herdn (e.g., by providing a nuddc add from tiie 
sample and contacting tiie nucleic add to an arr^). Hie mefliod can be used to diagnose a 
a ceU proliferative or differentiative disorder, e.g., cancer disorder in a subject wherein an 
increase or decrease in 3 1939 expression is an indication tiiat tiie subject has or is disposed 
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to having a cell proliferative or differentialive disorder, e.g., cancer. The method can be 
used to monitor a treatment for a cell proliferative or differentialive disorder, e.g., cancer in 
a subject. For example, the gene expression profile can be determined for a sample fi*om a 
subject undergoing treatment The profile can be compared to a reference profile or to a 
5 profile obtained fi-om the subject prior to treatment or prior to onset of Ihe disorder (see, e.g., 
Golub etal (1999) Science 286:531). 

In yet another aspect, the invention features a method of evaluating a test compound 
(see ako, "Screening Assays", above). The method includes providing a cell and a test 
compound; contacting the test compound to the cell; obtaining a subject expression profile 

10 for fte contacted cell; and comparing the subject expression profile to one or more reference 
profiles. The profiles include a value representing the level of 31939 expressioa In a 
preferred embodiment, die subject eiqpression profile is compared to a target profile, e.g., a 
profile for a normal cell or for desired condition of a cell The test compound is evaluated 
&vorab]y if die subject e?q)ression profile is more similar to the target profile than an 

15 e}q>ression profile obtained firom an uncontacted cell. 

" In another aspect, the invention features, a method of evaluating a subject The 
method includes: a) obtaining a sample fi-om a subject, e.g., firom a caregiver, e.g., a 
caregiver v^ho obtains die sample from die subject; b) determining a subject expression 
profile for die sample. Optionally, the method fiirflier includes either or bodi of steps: c) 

20 comparing the subject expression profile to one or more reference expression profiles; and 
d) selecting the reference profile most similar to the subject reference profile. The subject 
expression profile and the reference profiles include a value representing the level of 31939 
expression. A variety of routine statistical measures can be used to compare two reference 
profiles. One possible metric is the length of the distance vector that is the difference 

25 between the two profiles. Each of the subject and reference profile is represented as a multi- 
dimensional vector, wherein each dimension is a value in the profile. 

The method can fiirther include transmitting a result to a caregiver. The result can be 
the subject e3q)ression profile, a result of a comparison of the subject expression profile with 
another profile, a most similar reference profile, or a descriptor of any of the 

30 aforementioned. The result can be transmitted across a computer network, e.g., the result 
can be in the fonn of a computer transmission, e.g., a computer data signal embedded in a 
carrier wave. 



-71- 



wo 01/75105 



PCTAJSOl/10380 



Also featured is a computCT medium having executable code for effecting flie 
following steps: receive a subject expression profile; access a database of reference ' 
expression profiles; and eidier i) select a matdiing reference profile most similar to the 
subject expression profile or ii) determine at least one comparison score for tiie similarity of 
5 the subject escpiession profile to at least one reference profile. The subject expression 
profile, and fee reference expression profiles each include a value represaiting the level of 
31939 expressioiL 

Arrays and Uses Thereof 

10 In anoflier aspect Ac inventicm features an array that includes a substrate having a 

plurality of addresses. At least one address of Ae plurality includes a c^»ture probe Aat 
binds specifically to a 31939 molecule (e.g., a 31939 nucleic add or a 31939 polypeptide). 
The array can have a density of at least than 10, 50, 100, 200, 500, 1,000, 2,000, or 10,000 
or more addresses/cm^ and ranges between. In a prefenred embodiment the plurality of 

15 addresses includes at least 10. 100, 500. 1,000, 5.000. 10.000, 50,000 addresses. In a 

preferred embodiment, the plurality of addresses includes equal to or less than 10, 100, 500, 
1,000,5,000, 10,000, or 50,000 addresses. The substrate can be a two-dimaisional substrate 
such as a glass slide, a wafer (e.g., silica or plastic), a mass spectroscopy plate, or a Aree- 
dimensional substrate such as a gel pad. Addresses in addition to address of the plurality 

20 can be disposed on Hoe array. 

M a preferred embodiment, at least one address of tiie plurality includes a nucleic 
add C{q>ture probe that hybridizes specifically to a 3 1 939 nucleic add, e.g., the sense or anti- 
sense strand, hi one preferred embodiment, a subset of addresses of tiie pluraUty of 
addresses has a nuddc acid capture probe for 3 1 939. Each address of the subset can mclude 

25 a capture probe that hybridizes to a different region of a 3 1939 nudeic acid. In another 
preferred embodiment, addresses of the subset include a capture probe for a 3 1939 nudeic 
add. Each address of flie subset is unique, overliq)ping, and complementary to a different 
variant of 3 1939 (e.g., an allelic variant; or all possible l^rpolfaetical variants). The airajr can 
be used to sequence 31939 by hybridization (see, e.g., U.S. Patent No. 5,695,940). 

30 An array can be generated by various methods, e.g., by photoUthographic me&ods 

(see, e.g., U.S. PatentNos. 5,143,854; 5,510,270; and 5,527,681), mechamcal methods (e.g., 
directed-flow mefliods as described in U.S. Patent No. 5,384,261), pin-based methods (e.g.. 
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as described in U.S. Pat No. 5,288,514), and bead-based techniques (e.g., as described in 
PCX .US/93/04145). 

In another preferred embodiment, at least one address of the pluraHty includes a 
polypeptide capture probe that binds specifically to a 31 939 polypeptide or fragment thereof. 

5 The polypeptide can be a naturally-occurring interaction partner of 31939 polypeptide. 
Preferabfy, the polypeptide is an antibody, e.g., an antibo<fy described herein (see "Anti- 
31939 Antibodies," above), such as a monoclonal antibody or a single-chain antibody. 

In another aspect, ttie invention features a method of analyzing the raqpression of 
31939. The method includes providing an may as described above; contacting «ie array 

10 with a sample and detecting binding ofa31939-molecule(e.g., nucleic acid or polypeptide) 
to the anay. In a preferred embodiment, the array is a nucleic acid array. OptionaUy the 
method further includes ampUfying nucleic acid from the sample prior or during contact 
with the array. 

In another embodiment, the array can be used to assay gene expression in atissue to 
15 ascertain tissue specificity of genes in the array, particularly the expression of 31939. If a 
sufficient number of diverse samples is analyzed, clustering (e.g., hierarchical clustering, k- 
means clustering, Bayesian clustering and the Uke) can be used to identify oflier genes which 
are co-regulated with 31939. For example, the array can be used for the quantitation of fte 
expression of multiple genes. Thus, not only tissue spedficily, but also the level of 
expression of a battery of genes in the tissue is ascertained. Quantitative data can be used to 
group (e.g., cluster) genes on the basis of their tissue expression;)er se and level of 
expression in that tissue. 

For «£anq)le, amy anatysis of gene expression can be used to assess the effect of 
ceU-cell interactions on 31939 expression. A first tissue can be perturbed and nucleic acid 
from a second tissue that interacts with the first tissue can be analyzed. In this context, the 
effect of one cell type on another ceU type in response to a biological stimulus can be 
detennined, e.g., to monitor flie effect of ceU-ceU interaction at the level of gene expression. 

In another embodiment, cells are contacted with a therapeutic agent The expression 
profile of Ae cells is determined using the array, and the expression profile is compared to 
the profile of like cells not contacted with tiie agent For example, the assay can be used to 
determine or analyze the molecular basis of an undesirable effect of the therapeutic agent If 
an agent is administered therapeutically to treat one cell type but has an undesirable effect 
on another cell type, the invention provides an assay to determine the molecular basis of the 
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undesirable effect and thus provides Ifae opportunily to co-administer a counteracting agent 
or oflierwise treat the undesired effect Similarly, even within a single cell type, undesirable 
biological effects can be detennined at Ifae molecular level. Thus, the effects of an agent on 
expression of otiier than the target gene can be ascertained and counteracted. 
5 hi another embodiment, the array can be used to monitor expression of one or more 

genes in the array with respect to time. For example, samples obtained from different time 
points can be probed with the array. Such analysis can identify and/or characterize the / 
development of a 31939-associated disease or disorder; and processes, such as a cellular 
transformation associated with a 3 1939-associated disease or disorder. The method can also 

10 evaluate tiie treatment and/or progression of a 3 1939-associated disease or disorder 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes (e.g.^ mcluding 
31939) tiiat could serve as a molecular target for diagnosis or tiierapeutic interventioa 
In another aspect, ifae invention features an array having a plurality of addresses. 

15 Each address of Ifae plurality includes a unique polypeptide. At least one address of the 
plurality has disposed thereon a 31 939 polypeptide or fragment thereof Methods of 
producing polypeptide arrays are described in the art, e.g., in De Wildt etal (2000). Nature 
Biotech.. 18,Sm-994\ Lucking era/. {1999). Ami Biochem. 270, 103-111; Ge, H. (5000). 
Nucleic Acids Res. 28, e3, 1-VII; MacBeath, Q, and Schreiber, S.L. (2000). Science 289, 

20 1760-1763; and WO 99/51773A1. fa a preferred embodiment, each addresses of the 

plurality has disposed thereon a polypeptide at least 60, 70, 80,85, 90, 95 or 99 % identical 
to a 3 1 939 polypeptide or fragment thereof For example, multiple variants of a 3 1 939 
polypeptide (e.g., encoded by allelic variants, site-directed mutants, random mutants, or 
combinatorial mutants) can be disposed at individual addresses of the plurality. Addresses 

25 in addition to the address of the plurality can be disposed on the array. 

The polypeptide array can be used to detect a 31939 binding compound, e.g., an 
antibody in a sample from a subject with specificity for a 31939 polypeptide or the presence 
of a 31939-binding protein or ligand. 

The array is also usefril for ascertaining the effect of the expression of a gene on tiie 

30 expression of other genes in the same ceU or in different cells (e.g. , ascertaining flie effect of 
31939 expression on the ejqpression of other genes). This provides, for example, for a 
selection of alternate molecular targets for tfaerapeutic mtervention if tfae ultimate or 
downstream target cannot be regulated. 
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In another aspect, the invention features a method of analyzing a plurality of probes. 
The method is useful, e.g., for analyzing gene expression. The method includes: providmg 
a two dimensional array having a pluraHty of addresses, each address of the plurality being 
positionally distinguishable from each other address of the plurahty having a unique capture 
5 . probe, e.g., wherein the capture probes are from a cell or subject which express 31939 or 
from a cell or subject in v^ch a 3 1939 mediated response has been elicited, e.g., by contact 

of the cell with 31939 nucleic add or protein, or administration to the cell or subject 31939 

I 

nucleic acid or protein; providing a two dimensional array having a plurality of addresses, 
each address of the plurality bemg positionally distinguishable from each other address of 

10 flie plurality, and each address of flie plurality having a unique capture probe, e.g., wherein 
the capture probes are from a cell or subject v^ch does not espress 3 1939 (or does not 
express as hi^y as in the case of the 31939 positive plurality of capture probes) or from a 
cell or subject whidi in wliich a 31939 mediated response has not been elicited (or has been 
elicited to a lesser extent than in the first sample); contacting the array with one or more 

15 inquiry probes (which is preferably other than a 31939 nucleic acid, pofypeptide, or 

antibody), and tiiereby evaluating the plurality of capture probes. Binding, e.g., in the case 
of a nucleic acid, hybridization with a capture probe at an address of the plurahty, is 
detected, e.g., by signal generated from a label attached to the nucleic acid, polypeptide, or 
antibody. 

20 In another aspect, the invention features a method of analyzing a plurahty of probes 

or a sample. The mefliod is usefiil, e.g., for analyzing gene expression. Theme&od 
includes: providing a two dimensional array having a plurality of addresses, each address of 
tiie plurality being positionally distinguishable from each other address of the plurality 
having a unique capture probe, contacting the array wifli a first sample from a cell or subject 

25 which express or mis-express 3 1939 or from a cell or subject in which a 3 1939-mediated 
response has been elicited, e.g., by contact of the cell with 31939 nucleic acid or protein, or 
administration to flie cell or subject 3 1939 nucleic acid or protein; providing a two 
dimensional array having a plurality of addresses, each address of the plurality being 
positionally distinguishable from each otiier address of the plurality, and each address of the 

30 plurahty having a unique capture probe, and contacting tiie array with a second sample from 
a cell or subject which does not express 31939 (or does not express as highly as in tiie case 
of the 31939 positive plurahty of capture probes) or from a cell or subject which in \s*ich a 
3 1 939 mediated response has not been eUcited (or has been ehcited to a lesser extent than m 
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the first sample); and comparing the binding ofihe first sample with the binding of the 
secondsample. Binding. e.g., in the case of a nucleic add, hybridization with a capture ' 
probe at an address ofihe plurality, is detected, e.g.. by signal generated fi-om a label 
attadied to the nucleic acid, polypeptide, or antibody. THe same array can be used for both 
5 samples or different arr^ can be used. If different arr^ are used the pluraUty of 
addresses with capture probes should be pipsent on both an^. j 

m another aspect, flie invention features amethod of analyzing 31939. e.g.. analyzing 
structure, function, or relatedness to other nucldc add or amino add sequences. Tlie | 

method faicludes: providing a 31939 nucldc acid or amino add sequence; comparing «ie 
31939 sequence with one or more preferably a plurality of sequences firom a collection of 
sequences, e.g., a nucleic acid or protein sequeftce database; to thereby analyze 31939. 



10 



Dfitection of Variations o r Mutations 

Hie methods of the invention can also be used to detect genetic alterations in a 31939 
15 gene, thereby determining if a subject with the altered gene is at risk for a disorddr 

(diaracterized by misregulation in 31939 protein activity or nudeic acid expression, sudi as 
a cell prt>liferative or differentiative disorder, e.g., cancer, or a neuronal disorder. In 
preferred embodiments, the methods include detecting, in a sample firom the subject, the 
presence or absence of a genetic alteration characterized by at least one of an alteration 
20 a&ctinglheintegrityof ageneencodinga31939-protein, or the mis-expression of the 

31939 gene. For example, sudi genetic alterations can be detected by ascertaining the 
existence of atleast one ofDaddetion of one or more nucleotides firoma31939gene;2)an 

addition of one or more nucleotides to a 31939 gene; 3) a substitution of one or more 
nucleotides of a 31939 gene, 4) a djromosomal reanangement of a 31939 gene; 5) an 

25 alteration in the levd of a messenger RNA transcript of a 31939 gene. 6) aberrant 

modification of a 31939 gene, sudi as ofihe mdhylation pattern of &e genomic DNA, 7) 
the presence of a non-wttd type splidng pattern of a messenger FNA transcript of a 31939 
gene, 8) a non-wUd type levd of a 31939-protein. 9) allettc loss of a 31939 gene, and 10) 
inappropriate post-translational modification of a 31939-protdn. 

30 An alteration can be detected without aprobe/primer in a polymerase chain reaction, 

such as anchor PGR or RACE PCR, or. alternatively, in a ligation diain reaction (LCR). the 
latter of whidi can be particularly usefiil for deteding point mutations in the 31939-gene. 
Ihis method can mclude tiie steps of coUecting a sample of cells firom a subjed. isolatmg 
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nucleic acid (e.g., genomic, mKNA or bofli) from the sample, contacting the nucleic add 
sample witii one or more primers \^*ich specifically hybridize to a 31939 gene imder 
conditions such that hybridization and amplification of the 31939-gene (if present) occurs, 
and detecting the presence or absence of an amplification product, or detecting the size of 

5 Ihe amplification product and comparing the length to a control sample. It is anticipated that 
PGR and/or LCR may be desirable to use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations described herein. Alternatively, 
other amplification methods described herein or knoivn in the art can be used. 

In ano&er embodiment, mutations in a 31939 gme &om a sample cell can be 

10 identified by detecting alterations in restriction mzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified (optionally), digested witii one or more 
restriction endonucleases, and fragment length sizes are determined, e.g., by gel 
electrophoresis and compared. Diflferences in fragment length sizes between sample and 
control DNA indicates mutations in the sample DNA Moreover, the use of sequence 

15 specific ribozymes (see, for example, U.S. Patent No. 5,498,531) can be used to score for 
ttie presence of specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in 31939 can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, two-dimensional arrays,- e.g., chip 
based arrays. Such arrays include a plurality of addresses, each of which is positionalfy 

20 distinguishable from the other. A different probe is located at each address of tiie plurality. 
A probe can be complementary to a region of a 31939 nucleic acid or a putative variant 
(e.g., alleUc variant) thereof. A probe can have one or more mismatches to a region of a 
31939 nucleic acid (e.g., a destabilizing mismatch). The arrays can have a high drasity of 
addresses, e.g., can contain hundreds or thousands of oUgonucleotides probes (Cronin, M.T. 

25 etal. {1996) HumanMutation 7: 244-255; Kozal, M.J. etaL {1996) Nature Medicine 2: 753- 
759). For example, genetic mutations in 31939 can be identified in two-dimensional arrays 
containing light-generated DNA probes as described in Cronin, M.T. etal supra. Briefly, a 
first hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between tiie sequences by making linear air^s 

30 of sequential overiapping probes. This step allows tiie identification of point mutations. 
This step is followed by a second hybridization arr^ that aUows the characterization of 
specific mutations by using smaller, specialized probe arrays complementary to all variants 
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or mutations detected. Each mutation zmy is composed of parallel probe sets, one 
complementary to Ihe wild-type grae and the oHier complementaiy to the mutant gene. ^ 

In yet anofter embodiment, any of a variety of sequencing reactions kno^ in the art 
can be used to du-ectly sequence the 3 1939 gene and detect mutations by comparing the 
5 sequence of Ihe sample 3 1 939 with the corresponding wild-lype (control) sequence. 
Automated sequencing procedures can be utilized when performing the diagnostic assays 
((1995) Biotechniques 19:448), including sequencing by mass spectromeby. | 

Other methods for detecting mutations in the 31939 gene include methods in wHich 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
10 RNA/DNAheteroduplexes(Myerse/a/. (1985) Sbfewce 230:1242; Cotton era/. (1988) 
Proc. NatlAcadSci USA 85:4397; Saleebaera/. (1992) Methods EnzymoL 217:286-295). 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called ^NA 
mismatch repair" ensymes) in defined systems for detecting and mapping point mutations in 
15 31939 cDNAs obtained from samples of cells. For example, the mutY m2yme of £ coli 
cleaves A at G/A mismatches and fte thymidine DNA glycosylase from HeLa ceUs cleaves 
T at G/T mismatches CHbu etal (1994) Carcinogenesis 15:1657-1662; U.S. Patent No. 
5,459,039).-. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 
20 mutations in 31 939 genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al (1 989) Proc Natl Acad, Sci USA: 86:2766, see also Cotton (1993) 
Mutat, Res. 285:125-144; and Hayashi (1992) Genet. Anal Tech. Appl 9:73-79). Single- 
stranded DNA fragments of sample and control 31939 nucleic acids will be denatured and 
25 allowed to renature. The secondary structure of single-stranded nucleic acids varies 
according to sequence, the resulting alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA fragments may be labeled or detected 
with labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather 
than DNA), in which the secondary structure is more sensitive to a change in sequence. Li a 
30 preferred embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen 
etal. (1991) Trends Genet 1:5). 
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In yet another embodiment, flie movemmt of mutant or wild-type fragments in 
polyaciylamide gels containing a gradient of denaturant is assayed using denaturing gradient 
gel electrophoresis (DGGE) (Myers etal (1985) Nature 313:495). When DGGE is used as 
the method of analysis, DNA will be modified to insure that it does not completely denature, 
5 for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA 
by PGR. In a further embodiment, a temperature gradient is used in place of a denaturing 
gradient to identify differences in the mobility of control and sample DNA (Rosenbaum and 
RcissneT(l9i7)BiophysChem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 

10 limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension (Saikie/ a/. (1986) i\to/wre 324:163); Saikie/a/. il9i9)Proc, NatlAcad 
Sci USA 86:6230). A furftier method of detecting point mutations is flie chemical ligation of 
oligonucleotides as described in Xu etal ((2001) Nature BiotechnoL 19:148). Adjacent 
oligonucleotides, one of which selectively anneals to the queiy site, are ligated together if 

IS the nucleotide at the queiy site of the sample nucleic add is complementaiy to the queiy 
oligonucleotide; ligation can be monitored, e.g., by fluorescent (fyes coupled to the 
oligonucleotides. 

Altematively,-allele-specific amplification technology that depends on selective PCR 

amplification may^ be used in conjunction with the instant invention. Oligonucleotides used 

20 as primers for specific amplification may cany the mutation of interest in the center of the 
molecule (so that amplification depends on differential hybridization) (Gibbs etal (1989) 
Nucleic Acids Res. 17:2437-2448) or at the extreme 3' end of one primer where, under 
appropriate conditions, mismatch can prevent, or reduce polymerase extension (Prossner 
(1993) Tibtech 1 1 :238). In addition it may be deskable to introduce a novel restriction site 

25 in the region ofthe mutation to create cleavage-based detection (Gasparinief a/, (1992)Mo/. 
Cell Probes 6:1). It is anticipated that in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany {1991) Proc. Natl Acad, Sci USA 
88: 1 89). In such cases, ligation will occur only if there is a perfect match at the 3' end ofthe 
5* sequence making it possible to detect the presence of a known mutation at a specific site 

30 by looking for the presence or absmce of amplification. 

In another aspect, the invention features a set of oligonucleotides. The set includes a 
plurality of oligonucleotides, each of which is at least partially complementary (e.g., at least 
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50%, 60%, 70%, 80%, 90%, 92%, 95%, 97%, 98%, or 99% complementaiy) to a 31939 
nucleic acid. ^ 

In a preferred embodiment the set includes a first and a second oligonucleotide. The 
first and second oligonucleotide can hybridize to the same or to different locations of SEQ 
5 ID N0:1 or the complement of SEQ ID N0:1 . Different locations can be different but 
overlapping, or non-overlapping on flie same strand. The first and second oligonucleotide 
can hybridize to sites on the same or on different strands. / 
The set can be usefiil, e.g., for identifying SNP's, or identifying specific alleles pf 
31939. In a preferred embodinient, each oligonucleotide of the set has a differentnucleotide 

10 at an interrogation position. In one embodiment tfie set includes two oligonucleotides, each 
complementaiy to a different allele at a locus, e.g., a biallelic or polymorphic locus. 

In another embodiment, the set includes four oligonucleotides^ each having a 
different nucleotide (e.g., adenine, guanine, cytosine, or ftymidine) at the interrogation 
positioa The interrogation position can be a SNP or the site of a mutation. In another 

15 preferred embodiment, the oligonucleotides of the plurality are identical in sequence to one 
another (except for differences in length). The oligonucleotides can be provided with 
differential labels, such that an oligonucleotide fliat hybridizes to one allele provides a signal 
that is distinguishable fi-om an oligonucleotide that hybridizes to a second allele. In still 
another embodiment, at least one of the oligonucleotides of the set has a nucleotide change 

20 at a position in addition to a query position, e.g., a destabilizing mutation to decrease die T^ 
of the oligonucleotide. In another embodiment, at least one oligonucleotide of tiie set has a 
non-natural nucleotide, e.g., inosine. In a preferred embodiment, the oligonucleotides are 
attached to a solid support, e.g., to different addresses of an array or to different beads or 
nanoparticles. 

25 In a preferred embodunent the set of oligonucleotides can be used to specifically 

amplify, e.g., by PGR, or detect, a 31939 nucleic add. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be convenientiiy used, e.g., in clinical settings to diagnose 

30 patients exhibiting symptoms or family histoiy of a disease or illness involving a 31939 
gene. 
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Use of 31939 Molecules as Siurogate Markers 

The 31939 molecules of the invention are also useful as markers of disorders or 
disease states, as markers for precursors of disease states, as markers for predisposition of 
disease states, as markers of drug activity, or as markers of the pharmacogenomic profile of 

s a subject Using Ihe methods described herdn, the presence, absence and/or quantity of the 
31939 molecules of Ae invention may be detected, and may be correlated with one or more 
biological states in vivo. For example, the 3 1939 molecules of the invention may serve as 
surrogate markers for one or more disorders or disease states or for conditions leading up to 
disease states. As used herein, a "surrogate marker" is an objective biochemical marker 

10 which correlates with the absmce or presmce of a disease or disorder, or with the 

progression of a disease or disorder (e.g., with the presence or absence of a tumor). The 
presence or quantity of such markers is independent of the disease. Therefore, tiiese 
markers may serve to indicate whether a particular course of treatment is effective in 
lessening a disease state or disorder. Surrogate markers are of particular xise when the 

15 presence or extent of a disease state or disorder is difiBcult to assess through standard 
methodologies (e.g., early stage tumors), or when an assessment of disease progression is 
desired before a potentially dangerous clinical endpoint is reached (e.g., an assessment of 
cardiovascular disease may be made using cholesterol levels as¥surrogate marker, and an 
analysis of HIV infection may be made using HTV RNA levels as a surrogate marker, well in 

20 advance of the undesirable clinical outcomes of myocardial infarction or fully-developed 
AIDS). Examples of the use of surrogate markers in the art include: Koomen et al (2000) J. 
Mass. Spectront 35: 258-264; and James (1994)^5 Treatment News Archive 209. 

The 31939 molecules of the invention are also useful as pharmacodynamic markers. 
As used herein, a ""pharmacodynamic marker" is an objective biochemical marker which 

25 correlates specifically witti drug effects. The presence or quantity of a pharmacodynamic 
marker is not related to the disease state or disorder for which the drug is being 
administered; therefore, the presence or quantity of the marker is indicative of tiie presence 
or activity of the drug in a subject For example, a pharmacodynamic marker may be 
indicative of the concentration of the drug in a biological tissue, in that the marker is eidier 

30 expressed or transcribed or not expressed or transcribed in that tissue in relationship to the 
level of the drug. In this fashion, the distribution or uptake of the drug may be monitored by 
the pharmacodynamic marker. Similarly, the presence or quantity of the pharmacodynamic 
marker may be related to &e presence or quantity of the metabolic product of a drug, such 
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that the presence or quantity of the marker is indicative of the relative breakdown rate of the 
drug in vivo. Pharmacodynamic markers are of particular use in increasing the sensitivity of 
detection of drug effects, particularly when the drug is administered in low doses. Since 
even a small amount of a drug may be sufficient to activate multiple rounds of marker (e.g., 
5 a 31939 mailcer) transcription or expression, the amplified marker may be in a quantity 
vsdiidi is more readity detectable than the drug itself. Also, the marker may be more e^y 
detected due to flie nature of the marker itself; for example, using Ate mefliods described 
herein, anti-31939 antibodies may be employed in an immime-based detection system for a 
31939 protem marker, or 31939-specific radiolabeled probes may be used to detect a 31939 
10 mJRNA marker. Furthermore, tiie use of a pharmacodynamic marker may offer mechanism- 
based prediction of risk due to drug treatment beyond the range of possible direct 
observations. Examples of the use of pharmacodynamic markers in the art include: Matsuda 
etal US 6,033,862; Hattis etal (1991) Env. Health Perspect 90: 229-238; Schentag (1999) 
Am, J, Health-Syst. Pharm, 56 Suppl. 3: S21-S24; andNicolau (1999) ^m, J. Health-Syst 
15 Pharm 56 Suppl. 3: S16-S20. 

The 31939 molecules of the invention are also xiseful as pharmacogenomic markers. 
As used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e.g., 
McLeod etaL (1999) Eur. J. Cancer 35:1650-1652). The presence or quantity of tiie 
20 pharmacogenomic marker is related to the predicted response of the subject to a specific 
drug or class of drugs prior to administration of the drug. By assessing the presence or 
quantity of one or more pharmacogenomic markers in a subject, a drug therapy which is 
most appropriate for the subject, or whidi is predicted to have a greater degree of success, 
may be selected. For example, based on the presence or quantity of RNA, or protein (e.g., 
25 31939 protein or RNA) for specific tumor markers in a subject, a drug or course of treatment 
may be selected tiiat is optimized for the treatment of the specific tumor likely to be present 
in the subject Similariy, the presence or absence of a specific sequence mutation in 31939 
DNA may correlate 31939 drug response. The use of pharmacogenomic markers tiierefore 
permits the application of tiie most appropriate treatment for each subject witiiout having to 
administer the therapy. 

Pharmaceutical Compositions 

The nucleic acid and polypeptides, firagmaits thereoi^ as weQ as anti-31939 
antibodies (also referred to herein as "active compounds'^ of tiie invention can be 
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incorporated into pharmaceutical compositions. Such compositions typically include the 
nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. As 
used herein the language "pharmaceutically acceptable carrier" includes solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
5 and the like, compatible with pharmaceutical administration. Supplementary active 
compounds can also be incorporated into the compositions. 

A phannaceuticai composition is formulated to be compatible with its intended route 
of administratioa Examples of routes of administration include parenteral, e.g., 
intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (topical), 

10 transmucosal, and rectal administratioa Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include Ifae following componmts: a sterile 
diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, 
glycerine, propylene gtycol or other synthetic solvents; antibacterial agents such as benzyl 
alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating 

15 agents such as ethylenediaminetetraacetic acid; buffers sudi as acetates, dtrates or 

phosphates and agents for the adjustm^t of tonicity such as sodium chloride or dextrose. 
pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

20 Pharmaceutical compositions suitable for injectable use include sterile aqueous 

solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
Parsippany, NJ) or phosphate buffered saline (PBS). Li all cases, the composition must be 

25 sterile and should be fluid to the extent that easy syringabiUty exists. It should be stable 
under the conditions of manu&cture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fimgi. Hie carrier can be a 
solvent or dispersion medium containing, for example, water, ethanol, po(/ol (for example, 
glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and suitable 

30 mixtures thereof The proper fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of &e required particle size in the case of 
dispersion and by the use of surfactants. Prevention of the action of microorganisms can be 
achieved by various antibacterial and antifungal agents, for example, parabens. 
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cUorobutanol, phenol, ascorbic acid, fcimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as manitol, ' 
sorbitol, sodium chloride in the compositioa Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays 
5 absorption, for example, almninum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound in 
the required amount in an impropriate solvent with one or a combination of ingredients | 
enumerated above, as requhred, followed by filtered isterilizatioa Generally, dispersions^' are 
prepared by incorporating the active compound into a sterile vehicle which contains a basic 
10 dispersion medium and the required other ingredients firom diose enumerated above. In tiie 
case of sterile powders for the preparation of sterile injectable solutions, the preferred 
methods of preparation are vacuum drying and fi-eeze-diying which yields a powder of the 
active ingredient plus any additional desired ingredient fi-om a previously sterile-filtered 
solution 1faereo£ 

15 Oral compositions gmeraUy include an inert dilumt or an edible carrier. For the 

purpose of oral Aerapeutic administration, tiie active compound can be incorporated with 
excipients and used in the form of tablets, troches, or capsules, e.g., gelatin capsules. Oral 
compositions can also be prepared using a fluid carrier for use as a mouthwash. 
Pharmaceutically compatible binding agents, and/or adjuvant materials can be included as 

20 part of the composition. The tablets, pills, capsules, troches and the like can contain any of 
the following ingredients, or compounds of a similar nature: a binder such as 
microciystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, 
a disintegrating agent such as alginic acid, PrimogeL, or com starch; a lubricant such as 
magnesium stearate or Sterotes; a glidant such as colloidal siUcon dioxide; a sweetening 

25 agent such as sucrose or saccharin; or a flavoring agent sudbi as peppermint, methyl 
salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas sudi as carbon dioxide, or a nebulizer. 

30 Systemic administration can also be by transmucosal or transdermal means. For 

transmucosal or transdermal administmtion, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in flie art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fiisidic acid 
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derivatives. Transmucosal administration can be accomplished through the use of nasal 
sprays or suppositories. For transdermal administration, the active compounds are 
formulated into ointments, salves, gels, or creams as generally known in the art. 

Hie compounds can also be prepared in the form of suppositories (e.g., with 

5 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect 
the compound against rapid elimination from flie body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 

10 biocompatible pofymers can be used, such as ethylme vinyl acetate, polyanhydrides, 

polyglycoUc acid, collagen, polyorthoesters, and polylactic acid. Mefliods for preparation of 
such formulations will be apparent to ttiose skilled in tfie art The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monocloiial antibodies to 

15 viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in the art, for example, as described 
in U.S. Patent No. 4,522,811 . 

■ It is advantageous to formulate oral or parenteral compositions in dosage unit form 
for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 

20 to physically discrete units suited as unitary dosages for the subject to be treated; each unit 
containing a predetermined quantity of active compound calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. 

Toxicity and tiierapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 

25 LD50 (the dose lethal to 50% of the population) and the ED50 (the dose tiierapeutically 
effective in 50% of the population). The dose ratio between toxic and ther?5)eutic effects is 
the therapeutic index and it can be expressed as the ratio LD50/ED50. Compounds which 
exhibit high therapeutic indices are prefmed. While compounds that exhibit toxic side 
effects meiy be used, care should be taken to design a delivery system that targets such 

30 compounds to ttie site of affected tissue in order to minimize potential damage to uninfected 
cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
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preferably within a range of circulating concentrations ftat include tiie ED50 with little or 
no toxicity. The dosage may vaiy within this range depending upon the dosage form ' 
employed and the route of administration utilized. For any compound used in the method of 
the invention, the therapeutically effective dose can be estimated initially from cell culture 
5 assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (Le., the concentration of the test compound j 
whidi achieves a half-maximal inhibition of symptoms) as determined in cell culture. Sjuch 
infonnation can be used to more accurately determine useful doses in humans. Levels fn 
plasma may be measured, for example, by higjbi performance liquid chromatography. 

10 As deJGned herein, a therapeutically effective amount of protein or polypeptide (i.e., 

an effective dosage) ranges from about 0.001 to 30 mg/kg body wei^t, preferably about 
0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body weighs and 
even more preferably about 1 to 10 mg/kg, 2 to 9 mg/1%, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 
6 mg/kg body weight The protein or polypeptide can be administered one time per week 

15 for between about 1 to 1 0 weeks, preferably between 2 td 8 weeks, more preferably between 
about 3 to 7 weeks, and even more preferably for about 4, S, or 6 weeks. The skilled artisan 
will appreciate that certam factors may influence the dosage and timing required to 
effectively treat a subject, including but not hmited to the severity of the disease or disorder, 
previous treatments, the general health and/or age of the subject, and other diseases present. 

20 Moreover, treatment of a subject with a therapeutically effective amount of a protein, 

polypeptide, or antibody can include a single treatment or, preferably, can include a series of 
treatments. 

For antibodies, the preferred dosage is 0.1 mg/kg of body weigjit (generally 10 

mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 mg/kg to 100 
25 mg/kg is usually appropriate. Generally, partially human antibodies and fiilfy human 

antibodies have a longer half-life within the human body than other antibodies. 

Accordingly, lower dosages and less frequent administration is often possible. 

Modifications such as lipidation can be used to stabilize antibodies and to enhance uptake 

and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
30 described by Cruikshank et ah ((1 997) J. Acquired Immune Deficiency Syndromes and 

Human Retrovirology 14:193). 

Hie present invention encompasses agents ^ch modulate expression or activity. 

An agent may, for example, be a small molecule. For example, sudi small molecules 



-86- 



wo 01/75105 



PCTAJSOl/10380 



include, but are not lunited to, peptides, peptidomimetics (e.g., peptoids), amino acids, 
aniino acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide 
analogs, organic or inorganic compounds (i.e.,. including heteroorganic and organometallic 
compounds) having a molecular weight less than about 10,000 grams per mole, organic or 
5 inorganic compounds having a molecular weight less than about 5,000 grams per mole, 
organic or inorganic compoimds having a molecular weight less than about 1,000 grams per 
mole, organic or inorganic compounds having a molecular weight less than about 500 grams 
per mole, and salts, esters, and other phamiaceutically acceptable forms of such compounds. 
Exemplary doses include milligram or microgram amounts of the small Itnolecule per 

10 kilogram of subject or sample weight (e.g., about 1 microgram per kilogram to about 500 
milligrams per kilogram, about 100 microgranis per kilogram to about 5 milligrams per 
kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. It is 
furthermore understood that appropriate doses of a small molecule depend upon the potency 
of Ifae small molecule mih respect to the expression or activity to be modulated. When one 

15 or more of these small molecules is to be admmistered to an animal (e.g., a human) in order 
to modulate expression or Activity of a polypeptide or nucleic acid of the invention, a 
physician, veterinarian, or researcher may, for example, prescribe a relatively low dose at 
first, subsequently increasing the dose until an appropriate response is obtained. In addition, 
it is understood that the specific dose level for any particular animal subject will depend 

20 upon a variety of factors including the activity of the specific compound employed, the age, 
body weight, general health, gender, and diet of the subject, the time of administration, the 
route of administration, the rate of excretion, any drug combination, and the degree of 
expression or activity to be modulated. 

An antibody (or fragment thereof) m^ be conjugated to a therapeutic moiety such as 

25 a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent 
includes easy agent that is detrimental to cells. Examples include taxol, cytochalasin B, 
gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, 
vinblastine, colchicm, doxorubicm, daunorubicin, dil^dro^Qr antfiracin dione, mitoxantrone, 
mithramycin, actinoxnyda D, 1-delQ^drotestosterone, glucocorticoids, procaine, tetracaine, 

30 Udocaine, propranolol, and puromycin and analogs or homologs thereof Therapeutic agents 
include, but are not lunited to, antimetabolites (e.g., methotrexate, 6-merc^topurine, 6- 
thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., 
mechloretfaamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine 
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(CCNU), cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin Q and 
cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorabicin ' 
(fonnerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 
actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic ageats 
5 (e.g., vincristine and vinblastine). 

The conjugates of the invention can be used for modifying a given biological / 
re^onse, the drug moiely is not to be construed as limited to classical chemical ther^eutic 
agents. For example. Hie drug moiely xaay be a protein or polypeptide possessing a 
biological activity. Such proteins may include, for example, a toxin such as abrin, ricin A, 
10 pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor necrosis factor, a- 
interferon, p-interferon, nerve growth fector, platelet derived growth factor, tissue 
plasminogen activator, or, biological response modifiers such as, for example, lymphokines, 
interieukin-1 CU-H, interleukin-2 (*U-2"), mterleuldn.6 CTL-6"), granulocyte 
macrophase colony stimulating factor ("GM-CSF"), granulocyte colony stimulating factor 
15 ("GCSF"), or other growth factors. 

Alternatively, an antibody can be conjugated to a sefeond antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

Ihe nucleic acid molecules of the invention can be inserted into vectors and used as 
gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
20 intravenous injection, local administration (see U.S. Patent 5,328,470) or by stereotactic 
injection (see e.g., Chen etal (1994) Proc. Natl Acad ScL USA 91:3054-3057). The 
pharmaceutical preparation of the gene therapy vector can include tiie gene therapy vector in 
an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 
vehicle is imbedded. Alternatively, where the complete gene delivery vector can be 
25 produced intact firom recombinant cells, e.g., retroviral vectors, &e pharmaceutical 
preparation can include one or more cells v^ch produce the gene delivery sjrstem. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

30 Methods of Treatment 

The present invention provides for boft prophylactic and tiierapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated witibi 
aberrant or unwanted 3 1 939 expression or activity. As used herein, tfie term "treatmoir is 
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defined as the application or administration of a tiierapeutic agent to a patient, or application 
or administration of a therapeutic agent to an isolated tissue or cell line from a patient v/ho 
has a disease, a symptom of disease or a predisposition toward a disease, with the purpose to 
cure, heal, alleviate, relieve, alter, remedy, ameliorate, improve or affect the disease, the 
5 symptoms of disease or the predisposition toward disease. A therapeutic agent includes, but 
is not limited to, small molecules, peptides, antibodies, ribozymes and antisense 
oligonucleotides. 

With regards to botii prophylactic and therapeutic methods of treatment, such 
treatments may be spedfically tailored or modified, based on knowledge obtained firom the 

10 field of phaimacogenomics. 'Tharmacogenomics", as used herein, refers to the application 
of genomics technologies such as gene sequencing, statistical genetics, and gene expression 
analysis to drugs in clinical development and on tiie market More specifically, tiie term 
refers the study of how a patient's genes determine his or her response to a drug (e.g., a 
patients "drug response pheno^e", or "drug response genotype".) Thus, anoth^- aspect of 

15 the invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either tiie 31939 molecules of tiie present invention or 31939 modulators 
according to that individual's drug response genotype. Pharmacogenomics allows a 
clinician or physician to target prophylactic or tiierapeutic treatments to patients who will 
most benefit from tiie treatment and to avoid treatment of patients who will experience toxic 

20 drug-related side effects. 

In one aspect, the invention provides a method for preventing in a subject, a disease 
or condition associated vwth an aberrant or unwanted 31939 expression or activity, by 
administering to the subject a 3 1939 or an agent which modulates 31939 expression or at 
least one 3 1939 activity. Subjects at risk for a disease which is caused or contributed to by 

25 aberrant or unwanted 3 1939 expression or activity can be identified by, for example, any or 
a combination of diagnostic or prognostic assays as described herein. Administration of a 
prophylactic agent can occur prior to the manifestation of symptoms characteristic of the 
31939 aberrance, such that a disease or disorder is prevented or, altematively, delated in its 
progression. Depending on the ^e of 31939 aberrance, for example, a 31939, 31939 

30 agonist or 31939 antagonist agent can be used for treating the subject. The appropriate 
agent can be determined based on screening assays described herein. 

It is possible that some 3 1939 disorders can be caused, at least in pari; by an 
abnormal level of gene product or by tiie presence of a gene product exhibiting abnormal 
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activity. As such, the reduction in the level and/or activity of such gene products would 
bring about the amelioration of disorder symptoms. ' 

The 3 1939 molecules can act as novel diagnostic targets and therapeutic agents for 
controlling one or more of cellular proliferative and/or differentiative disorders and 
5 neurological disorders as described above, as well as disorders associated with bone 

metabolism, immune disorders, cardiovascular disorders, liver disorders, viral diseases, bam 
or metabolic disorders. / 
Aberrant e3q)ression and/or activity of 31939 molecules may mediate disorders | 
associated vAUi bone metabolisoL "Bone metabolism" refers to direct or indirect effects in 

10 the formation or degeneration of bone structures, e.g., bone formation, bone resorption, etc., 
v^ch may ultimately affect the concentrations in serum of calcium and phosphate. This 
term also includes activities mediated by 31939 molecules effects in bone cells, e.g. 
osteoclasts and osteoblasts, ttiat may in tum result in bone formation and degeneration. For 
example, 31939 molecules may support different activities of bone resorbing osteoclasts 

IS such as the stimulation of differentiation of monocytes and mononuclear phagocytes into 
osteoclasts. Accordingly, 31939 molecules Ihat modulate the production of bone cells can 
influence bone formation and degeneration, and flius may be used to treat bone disorders. 
Examples of such disorders include, but are not limited to, osteoporosis, osteodystrophy, 
osteomalacia, rickets, osteitis fibrosa cystica, renal osteodystrophy, osteosclerosis, anti- 

20 convulsant treatment, osteopenia, fibrogenesis-imperfecta ossium, secondary 

hyperparathyrodism, hypoparathyroidism, hyperparathyroidism, cirrhosis, obstructive 
jaundice, drug induced metaboUsm, medullary carcinoma, chronic renal disease, rickets, 
sarcoidosis, glucocorticoid antagonism, malabsorption syndrome, steatonhea, tropical spme, 
idiopathic hypercalcemia and milk fever. 

25 The 3 1939 nucleic acid and protein of the invention can be used to treat and/or 

diagnose a variety of inmiune disorders. Examples of immune disorders or diseases include, 
but are not limited to, autoimmune diseases (including, for example, diabetes mellitus, 
arthritis (including iheumatoid arthritis, juv^e rheumatoid arthritis, osteoartiiritis, psoriatic 
arthritis), multiple sclerosis, encephalomyelitis, myasthenia gravis, systemic lupus 

30 eiytfaematosis, autoimmune thyroiditis, dermatitis (including atopic dermatitis and 

eczematous dermatitis), psoriasis, Sjdgren's Syndrome, Crohn's disease, aphthous ulcer, 
iritis, conjunctivitis, keratoconjunctivitis, ulcerative colitis, asthma, allergic asthma, 
cutaneous lupus erythematosus, scleroderma, vaginitis, proctitis, drug eruptions, leprosy 
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reversal reactions, eiyfliema nodosum leprosum, autoimmune uveitis, allergic 
encephalomyelitis, acute necrotizing hemorrhagic encephalopathy, idiopathic bilateral 
progressive sensorineural hearing loss, aplastic anemia, pure red cell anemia, idiopathic 
thrombocytopenia, polychondritis, Weg«ier*s granulomatosis, chronic active hepatitis, 
5' Stevens-Johnson syndrome, idiopathic sprue, lichen planus, Graves' disease, sarcoidosis, 
primary biliaiy cirrhosis, uveitis posterior, and iaterstitial lung fibrosis), graft-versus-host 
disease, cases of transplantation, and allergy such as, atopic allergy. 

Examples of disorders involving the heart or "cardiovascular disorder" include, but 
are not limited to, a disease, disorder, or state involving the cardiovascular system, e.g., the 
10 heart, the blood vessels, and/or the blood. A cardiovascular disorder can be caused by an 
unbalance in arterial pressure, a malfunction of the heart or an occlusion of a blood vessel, 
e.g., by a thrombus. Examples of such disorders include hypertension, atherosclerosis, 
coronary artery spasm, congestive heart failure, coronary arteiy disease, valvular disease, 
arilQrthmias, and cardiomyopathies. 

Disorders which may be treated or diagnosed by mefliods described herein include, 
but are not Ihnited to, disorders associated with an accumulation in the liver of fibrous 
tissue, such as that resulting fi-om an imbalance between production and degradation of the 
extracellular matrix accompanied by the collapse and condensation of preexisting fibers. 
The methods described herein can be used to diagnose or treat hepatocellular necrosis or 
injury induced by a wide variety of agents including processes which disturb homeostasis, 
such as an inflammatoiy process, tissue damage resulting fi"om toxic injury or altered 
hepatic blood flow, and infections (e.g., bacterial, viral and parasitic). For example, tiie 
methods can be used for the early detection of hepatic injury, such as portal hypertension or 
hepatic fibrosis. In addition, the metiiods can be employed to detect liver fibrosis attributed 
to inbom errors of metabolism, for example, fibrosis resulting firom a storage disorder such 
as Gaucher's disease (lipid abnormahties) or a glycogen storage disease, Al -antitrypsin 
deficiency; a disorder mediating the accumulation (e.g., storage) of an exogenous 
substance, for example, hemochromatosis (iron-overload syndrome) and copper storage 
diseases (Wilson's disease), disorders resulting in the accumulation of a toxic metabolite 
(e.g., tyrosinemia, fiructosemia and galactosemia) and peroxisomal disorders (e.g., Zellweger 
syndrome). Additionally, the mefliods described herein may be usefiil for the early detection 
and treatment of liver injury associated with the administration of various chemicals or 
drugs, such as for example, methotrexate, isonizaid, oxyphenisatin, metfayldopa. 
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chlorproma2dne, tolbutamide or alcohol, or which represents a hepatic manifestation of a 
vascular disorder such as obstruction of either the intrahepatic or extrahepatic bile flow or an 
alteration in hepatic circulation resulting, for example, from chronic heart failure, veno- 
occlusive disease, portal vein thrombosis or Budd-Chiari syndrome. 

5 . Additionally, 31939 molecules may play an important role in the etiology of certain 

viral diseases, including but not limited to Hepatitis B, Hepatitis C and Herpes Simplex 
Virus CHSV). Modulators of 3 1939 activity could be used to control viral diseases. The 
modulators can be used in tiie treatment and/or diagnosis of viral infected tissue or virus- 
associated tissue fibrosis, especially liver and liver fibrosis- Also, 31939 modulators can be 

10 used in the treatment and/or diagnosis of vims-associated carcinoina, especially 
hepatocellular cancer. 

Additionally, 3 1939 may play an important role in the regulation of metabolism or 
pain disorders. Diseases of metaboUc imbalance include, but are not limited to, obesity, 
anorexia nervosa, cachexia, lipid disorders, and diabetes. Examples of pain disorders 

15 include, but are not Umited to, pain response elicited during various forms of tissue injury, 
e.g., inflammation, infection, and ischemia, usually referred to as hyperalgesia (described in, 
for example. Fields, H.L. (1987) Pain, New York:McGraw-HiU); pain associated witii 
musculoskeletal disorders, e.g., joint pain; tooth pain; headaches; pain associated with 
surgeiy; pain related to irritable bowel syndrome; or chest pain. 

20 As discussed, successfiil treatment of31939 disorders can be brought about by 

tedmiques fiiat serve to inhibit the expression or activity of target gene products. For 
example, compounds, e.g., an agent identified using an assays described above, tiiat proves 
to exhibit negative modulatory activity, can be used in accordance with tiie invention to 
prevQit and/or ameliorate symptoms of 31939 disorders. Such molecules can include, but 

25 are not limited to peptides, phosphopeptides, small organic or inorganic molecules, or 
antibodies (including, for example, polyclonal, monoclonal, humanized, anti-idiotypic, 
chimeric or single chain antibodies, and Fab, F(ab')2 and Fab expression library Augments, 
scFV molecules, and epitope-binding fragments tiiereof). 

Further, antisense and ribo2yme molecules that inhibit expression of tiie target gene 

30 can also be used in accordance wdth the invention to reduce ihe level of target gene 
expression, thus effectively reducing tiie level of target gene activity. Still fiirflier, triple 
heUx molecules can be utiUzed in reducing the level of target gene activity. Antisense, 
riboTyme and triple helix molecules are discussed above. 



-92- 



wo 01/75105 



PCT/USOl/10380 



It is possible fliat Ihe use of antisense, ribozyme, and/or triple helix molecules to 
reduce or inhibit mutant gene expression can also reduce or inhibit the transcription (triple 
helix) and/or translation (antisense, ribozyme) of mRNA produced by normal target gene 
alleles, such that the concentration of normal target gene product present can be lower than 
5 is necessary for a normal phenotype. In such cases, nucleic acid molecules that encode and 
express target gene polypeptides exhibiting normal target gene activity can be introduced 
into cells via gene therapy method. Altematively, in mstances in that the target gene / 
encodes an extracellular protein, it can be preferable to co-administer normal target gene 
protein into Hie cell or tissue in order to maintain the requisite level of cellular or tissue 
10 taiget gene activity. 

Another method by v/bidi nucleic acid molecules may be utilized in treating or 
preventing a disease characterized by 31939 e:!q)ression is througjbi the use of ^tamer 
molecules specific for 31939 protein. Aptamers are nucleic acid molecules having a tertiary 
structure which permits them to specifically bind to protein ligands (see, e.g., Osbome, etaL 
15 (1997) Curr. Qpin. ChemBioL 1: 5-9; andPateI,D.I (1997)CurrOpm ChemBiol 1:32- 
46). Since nucleic acid molecules ma/ in many cases be more convenienfly introduced into 
target cells than therapeutic protein molecules may be, aptamers offer a method by which 
31939 protein activity may be specifically decreased without the introduction of drugs or 
other molecules which may have pluripotent effects. 

Antibodies can be generated that are both specific for target gene product and that 
reduce target gene product activity. Such antibodies may, therefore, by administered in 
instances whereby negative modulatory techniques are appropriate for the treatment of 
3 1939 disorders. For a description of antibodies, see the Antibody section above. 

In circumstances wherein injection of an animal or a human subject witii a 31939 
protein or epitope for stimulating antibody production is harmfiil to tiie subject, it is possible 
to generate an immune response against 31939 througji the use of anti-idiotic antibodies 
(see, for example, Herlyn, D. (1999) ^wnMerf 3 1:66-78; and Bhattachaiya-Chatteqee, M., 
and Foon, KA. (1998) Cancer Treat Res. 94:51-68). If an anti-idiotypic antibody is 
introduced into a mammal or human subject, it should stimulate the production of anti-anti- 
idiotypic antibodies, vriiich should be specific to the 31939 protein. Vaccines directed to a 
disease charact^ed by 31939 expression m^ also be generated in tiiis feshion. 

In instances where the target antigen is intracellular and ^ole antibodies are used, 
intemalizing antibodies may be preferred. Lipofectin or liposomes can be used to deliver 
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the antibody or a fragment of the Fab region that binds to the target antigen into cells. 
Where fragments of the antibody are used, the smallest inhibitoiy fragment that binds to the 
target antigen is preferred. For example, peptides having an amino acid sequence 
corresponding to the Fv region of the antibody can be used. Alternatively, single chain 
5. neutralizing antibodies ftat bind to intracellular target antigens can also be administered. 
Such single chain antibodies can be administered, for example, by expressing nucleotide 
sequences encoding single-chain antibodies ivitfain the target cell population (see e.g., 
Marasco et al (1993) Proc. Natl Acad Set USA 90:7889-7893). 

The identified compounds that inhibit target gene expression, synthesis and/or 

10 activity can be administered to a patient at therapeutically effective doses to prevent, treat or 
ameliorate 3 1939 disorders. A therapeutically effective dose refers to that amount of the 
compound sufficient to result in amelioration of symptoms of the disorders. Toxicity and 
therapeutic efficacy of such compounds can be determined by standard pharmaceutical 
procedures as described above. 

1 s The data obtained from the cell culture assays and animal studies can be used in 

formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with litfle or no 
toxicity. The dosage can vaiy within this range depending upon the dosage form employed 
and the route of administration utilized. For any compound used in the method of the 

20 invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose can be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (i e., the concentration of the test compound tiiat 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 
information can be used to more accurately determine usefril doses in humans. Levels in 

25 plasma can be measured, for example, by high performance liquid chromatography. 

Another example of determination of effective dose for an individual is the ability to 
directiy assay levels of "free" and '"bound"' compound in the serum of tiie test subject. Such 
assays may utilize antibody mimics and/or "biosensors" that have been created through 
molecular imprinting techniques. The compound which is able to modulate 31939 activity is 

30 used as a template, or "imprinting molecide", to spatially organize polymerizable monomers 
prior to ifaeir polymerization with catalytic reagents. The subsequent removal of the 
imprinted molecule leaves a polymer matrix ^ch contains a repeated ""negative image" of 
the compound and is able to selective^ rebind tiie molecule under biological assay 
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conditions. A detailed review of this technique can be seen in Aasell, KJ.etal (1996) 
Current Opinion in Biotechnology 7:89-94 and in Shea, K.J. (1994) lYends in Polymer 
Science 2:166-173. Such "imprinted*' affinity matrixes are amenable to ligand-binding 
assays, whereby the immobilized monoclonal antibody component is replaced by an 
5 appropriately imprinted matrix. An example of the use of such matrixes in this way can be 
seen in Vlatakis, G etal (1993) Nature 361:645-647. Through the use of isotope-labeling, 
the ''free" concentration of compound which modulates the expression or activity of 3|ll939 
can be readily monitored and used in calculations of IC50. | 
Such "imprinted" affinity matrixes can also be designed to include fluorescent 
10 groups vihose photon-emitting properties measurably change upon local and selective 
binding of target compound. These changes can be readily assayed in real time using 
appropriate fiberoptic devices, in turn allowing the dose in a test subject to be quickly 
optimized based on its individual IC50. An rudimentaiy example of such a *T>iosensor" is 
discussed in Kriz, D. etal {1995) Analytical Chemistry 67:2142-2144. 
15 Another aspect of the invention pertains to methods of modulating 31939 expression 

or activity for therapeutic purposes. Accordingly, in an exemplary embodunent, the 
modulatory melhod of flie invention involves contacting a cell with a 3 1939 or agent that 
modulates one or more of the activities of 31939 protein activity associated with the cell. 
An agent that modulates 3 1939 protein activity can be an agent as described herein, such as 
20 a nucleic acid or a protein, a naturally-occurring target molecule of a 3 1939 protein (e.g., a 
31939 substrate or receptor), a 31939 antibody, a 31939 agonist or antagonist, a 
peptidomimetic of a 31939 agonist or antagonist, or other small molecule. 

In one embodiment, the agent stimulates one or 3 1939 activities. Examples of such 
stimulatory agents mclude active 31939 protein and a nucleic acid molecule encoding 
25 31939. In another embodiment, the agent inhibits one or more 3 1939 activities. Examples 
of such mhibitoiy agents include antisense 31939 nucleic acid molecules, anti-31939 
antibodies, and 31939 inhibitors. These modulatory methods can be performed in vitro 
(e.g., by culturing the cell with the agent) or, altematively, in vivo (e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
30 individual afflicted with a disease or disorder characterized by aberrant or unwanted 

expression or activity of a 3 1939 protein or nucleic acid molecule. In one embodiment, the 
method involves administering an agent (e.g., an agent identified by a screening assay 
described herein), or combiaation of agents that modulates (e.g., up regulates or down 
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regulates) 31939 expression or activity. In another embodiment, the method involves 
administering a 31939 protein or nucleic acid molecule as therapy to compensate for 
reduced, aberrant, or unwanted 31939 expression or activity. 

Stimulation of 31939 activity is desirable in situations in which 31939 is abnormally 
5 downregulated and/or in which increased 3 1 939 activity is likely to have a beneficial effect. 
For example, stimulation of 3 1 939 activity is desirable in situations in which a 3 1 939 is 
downregulated and/or in which increased 31939 activity is likely to have a beneficial effect. 
Likewise, inhibition of 3 1 939 activity is desirable in situations in which 3 1 939 is 
abnomially upregulated and/or in which decreased 31939 activity is likely to have a 
beneficial effect 

Ph^acogenomips 

Hie 3 1939 molecules of the present invention, as well as agmts, or modulators 
which have a stimulatory or inhibitoiy effect on 31939 activity (e.g., 31939 gene 
expression) as identified by a screeoing assay described herein can be administered to 
individuals to treat (prophylactically or therapeutically) 31939 associated disorders (e.g., a 
cell proliferative or dififerentiative disorder, e.g., cancer, or a neuronal disorder) associated 
-with aberrant or unwanted 3 1939 activity. In conjunction witii such treatment, 
pharmacogenomics (i.e., the study of ttie relationship between an individual's genotype and 
that iadividual's response to a foreign compound or drug) may be considered. Differences in 
metabolism of therapeutics can lead to severe toxicity or therq)eutic failure by altering the 
relation between dose and blood concentration of the pharmacologically active drug. Thus, 
a physician or clinician may consider applying knowledge obtained in relevant 
pharmacogenomics studies in determining whether to administer a 3 1 93 9 molecule or 3 1 939 
modulator as well as tailoring the dosage and/or therapeutic regimen of treatment with a 
31939 molecule or 31939 modulator. 

Pharmacogenomics deals with clinically significant hereditaiy variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 
See, for example, Eichelbaum, M. etaL (1996) Clin. Exp. Pharmacol Physiol. 23:983-985 
and linder, M.W. et al (1997) Clin. Chem. 43:254-266. In general, two types of 
pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single &ctor altering tiie way dmgs act on the body (altered drug action) or genetic 
conditions transmitted as single factors altering the way the body acts on drugs (altered drug 
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metabolism). These pharmacogenedc conditioDS can occur eitter as rare genetic defects or 
as naturally-occurring polymorphisms. For example, glucose-6-phosphate dehydrogenise 
deficiency (G6PD) is a common inherited enzymopathy in which flie main clinical 
complication is haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, 
5 analgesics, nitrofurans) and consumption of fava beans. 

One phannacogenomics approach to identifying genes that predict drug response, 
known as "a genome-wide association" relies primarily on a high-resolution map of tiie 
human genome consisting of akeady known gene-related markers (e.g., a "bi-allelic" gene 
marker map which consists of 60,000-100,000 polymorphic or variable sites on the human 
10 genome, each of whidi has two variants.) Such a high-resolution genetic map can be 
compared to a map of the genome of each of a statistically significant number of patients 
taking part in a Phase n/m drug trial to identify markers associated with a particular 
observed drug response or side effect. Alternatively, such a hi^ resolution map can be 
generated firom a combination of some t^-million known single nucleotide pol^orphisms 
15 (SNPs) in the human genome. As used herem, a "SNP" is a common alteration that occurs 
in a single nucleotide base in a stretch of DNA For example, a SNP may occur once per 
every 1000 bases of DNA A SNP may be involved in a disease process, however, flie vast 
migority may not be disease-associated. Givra a genetic map based on the occurrence of 
such SNPs, individuals can be grouped into genetic categories depending on a particular 
20 pattem of SNPs in thefr individual genome. In such a manner, treatment regimens can be 
tailored to groups of genetically similar individuals, taking into account traits that may be 
common among sudi genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach," can be utilized to 
identify genes that predict drug response. According to this raetiiod, if a gene that encodes a 
25 drug's target is known (e.g., a 31939 protein of the present invention), all common variants 
of that gene can be fairly easily identified in the population and it can be determined if 
having one version of the gene versus another is associated with a particular drug response. 

Altematively, a method termed tiie "gene expression profiting,'' can be utilized to 
identify genes that predict drug response. For example, the gene expression of an animal 
30 dosed with a drug (e.g., a 31939 molecule or 31939 modulator of the present invention) can 
give an indication wfaetiier gene patiiways related to toxicity have been turned oa 

Information generated from more tiian one of the above pharmacogenomics 
approadies can be used to determine appropriate dosage and treatment regimens for 
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prophylactic or therapeutic treatment of an individual. This knowledge, v^en applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance 
therapeutic or prophylactic efficiency when treating a subject with a 31939 molecule or 
31939 modulator, such as a modulator identified by one of the exemplary screening assays 
5 described herein. 

The present invention fiirtfaer provides methods for identifying new agents, or 

combinations, that are based on identifying agents that modulate flie activity of one or more 

I 

ofUxe gene products encoded by one or more of the 31939 genes of the present invention, 
wherein these products may be associated with resistance of the cells to a ther^eutic agent 
10 Specificalfy, the activity of the proteins encoded by the 3 1 939 genes of the present invention 
can be used as a basis for identifydng agents for overcoming agent resistance. By blocking 
the activity of one or more of the resistance proteins, target cells, e.g., human cells, will 
become sensitive to treatment with an agent that the unmodified target cells were resistant 
to. 

15 Monitoring the influence of agents (e.g., drugs) on the expression or activity of a 

31939 protein can be applied in clinical trials. For example, the effectiveness of an agent 
determined by a screening assay as described herein to increase 31 939 gene expression, 
protein levels, or upregulate 31939 activity, can be monitored in clinical trials of subjects 
exhibiting decreased 31939 gene expression, protein levels, or downregulated 31939 

20 activity. Altematively, the effectiveness of an agent determined by a screening assay to 
decrease 31939 gene expression, protein levels, or downregulate 31939 activity, can be 
monitored in clinical trials of subjects exhibiting increased 31939 gene expression, protein 
levels, or upregulated 31939 activity. In such cUnical trials, the expression or activity of a 
31939 gene, and preferably, other genes that have been implicated in, for example, a 3 1939- 

25 associated disorder can be used as a ''read out" or markers of the phenotype of a particular 
ceU. 

31939 Informatics 

The sequence of a 31939 molecule is provided in a variety of media to &cilitate use 
30 th^eof A sequence can be provided as a manu&cture, other than an isolated nucleic acid or 
amino acid molecide, which contains a 3 1 939. Such a manufacture can provide a nucleotide 
or amino acid sequence, e.g., an open reading firame, in a form which aUows lamination of 
the manufiicture using means not directly applicable to examining fte nucleotide or amino 
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add sequences, or a subset Aeieoi^ as ftey exists in nature or in purified form. The 
sequence information can include, but is not limited to, 31 939 full-lengtii nucleotide anli/or 
amino acid sequences, partial nucleotide and/or amino acid sequences, polymorphic 
sequences including single nucleotide polymorphisms (SNPs), epitope sequence, arid the 
5 like. In a preferred embodiment, the manufacture is a machine-readable medium, e.g., a 
magnetic, optical, chemical or mechanical inforaiation storage device. i 

As used herein, '^machine-readable media'' refers to any medium that can be read and 
accessed directly by a machine, e.g., a digital computer or analogue compute. Non-limiting 
examples of a computer include a desktop PC, laptop, mainfifame, server (e.g., a web server, 

10 network server, or s^ver farm), handheld digital assistant, pager, mobile telephone, and the 
like. ITie computer can be stand-alone or connected to a communications network, e.g., a 
local area network (such as a VPN or intranet), a wide area network (e.g., an Extranet or flie 
Ihtemet), or a telephone network (e.g., a wireless, DSL, or ISDN network). Machine- 
readable media include, but are not limited to: magnetic storage media, such as floppy 

15 discs, hard disc storage medium, and magnetic tape; optical storage media such as CD- 
ROM; electrical storage media such as RAM, ROM, EPROM, EEPROM, flash memory, 
and the like; and hybrids of these categories such as magnetic/optical storage media. 
- A variety of data storage structures are available to a skilled artisan for creating a 

machine-readable medium having recorded thereon a nucleotide or amino acid sequence of 

20 the present invention. The choice of the data storage structure will generally be based on the 
means chosen to access the stored informatioa In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the 
present invention on computer readable medium. The sequence information can be 
represented in a word processing text file, formatted in commerciaUy-available software 

25 such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, 
stored in a database appUcation, such as DB2, Sybase, Oracle, or the like. The skilled 
artisan can readily adapt any number of data processor structuring formats (e.g., text file or 
database) in order to obtain computer readable medium having recorded thereon flie 
nucleotide sequence information of the present invention. 

30 In a preferred embodiment; the sequence information is stored in a relational , 

database (such as Sybase or Oracle). The database can have a first table for storing 
sequence (nucleic add and/or amino acid sequence) information. Hie sequence information 
can be stored in one field (e.g., a first column) of a table row and an identifier for tibe 
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sequence can be store in another field (e.g., a second column) of the table row. The database 
can have a second table, e.g., storing annotations. The second table can have a field fi)r the 
sequence identifier, a field for a descriptor or annotation text (e.g., the descriptor can refer to 
a fimctionality of fee sequence, a field for the initial position in the sequence to which the 

5 annotation refers, and a field fijrihe ultimate position in the sequence to which the 

annotation refers. Non-limiting examples for annotation to nucleic acid sequences include 
polymoiphisms (e.g., SNP's) translational regulatoiy sites and spUce junctions. Non- 
limiting examples for annotations to amino add sequence include polypeptide domains, e.g., 
a domain described herein; active sites and other functional amino acids; and modification 

10 sites. 

By providing the nucleotide or amino add sequences of ttie invention in computer 
readable form, the skilled artisan can routine^ access the sequence information for a variety 
of purposes. For example, one skilled in the art can use the nucleotide or amino add 
sequences of the invention in computer readable form to compare a target sequence or target 
15 structural motif with the sequence information stored within the data storage means. A 
search is used to identify fragments or regions of tiie sequences of the invention which 
match a particular target sequence or target motif The search can be a BLAST search or 

otiier routine sequence comparison, e.g., a search described herein. - - 

Thus, in one aspect, tiie invention features a method of analyzing 31939, e.g., 
20 analyzing structure, fimction, or relatedness to one or more other nucleic acid or amino acid 
sequaices. The method indudes: providing a 31939 nucleic acid or amino acid sequence; 
comparing the 31939 sequence with a second sequence, e.g., one or more preferably a 
plurality of sequences from a collection of sequences, e.g., a nucleic add or protein 
sequence database to thereby analyze 31939. Ihe method can be performed in a machine, 
25 e.g., a computer, or manual^ by a skilled artisan. 

The method can include evaluating flie sequence identity between a 31939 sequence 
and a database sequence. The method can be performed by accessing the database at a 
second site, e.g., over tiie IntemeL 

As used herein, a "target sequence" can be any DNA or amino acid sequence of sk 
30 ormorenudeotidesortwoormoreammoacids. A skilled artisan can readily recognize that 
the longer a target sequence is, the less likely a target sequence will be present as a random 
occurrence in the database. TVpical sequence lengflis of a target sequence are from about 10 
to 100 amino acids or from about 30 to 300 nucleotide residues. However, it is weU 
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recognized lhat commercially important fragments, such as sequence fragments involved in 
gene expression and protein processing, may be of diorter length. ^ 
Computer software is publicly available which allows a skilled artisan to access 
sequence information provided in a computer readable medium for analysis and comparison 
5 to other sequences. A variety of known algorithms are disclosed publicly and a variety of 
commercially available software for conducting search means are and can be used in the 
computer-based systems of the present invention. Examples of such software include] but 
are not limited to, MacPattem (EMBL), BLASTN and BLASTX (NCBI). j 
Thus, the invention features a method of making a computer readable record of a 

10 sequence of a 3 1 939 sequence which includes recording the sequence on a computer 
readable matrix. In a preferred embodiment &e record includes one or more of the 
following: identification of an ORF; identification of a domain, region, or site; identification 
of the start of transcription; identification of the transcription terminator; the frill length 
amino acid sequence of the protein, or a mature form thereof, fte 5' end of tbe translated 

IS region. 

In another aspect; the invention features, a method of analyzmg a sequence. The 
mefliod includes: providing a 31939 sequence, or record, in machine-readable form; 
comparing a second sequence to the 31939 sequence; thereby analyzing a sequence. 
Comparison can include comparing to sequences for sequence identity or deteimining if one 

20 sequence is included within fte oflier, e.g., determining if the 31939 sequence includes a 
sequence being compared. In a preferred embodiment the 31939 or second sequence is 
stored on a first computer, e.g., at a first site and the comparison is performed, read, or 
recorded on a second computer, e.g., at a second site. E.g., the 31939 or second sequence 
can be stored in a public or proprietary database in one computer, and the results of the 

25 comparison performed, read, or recorded on a second computer. In a preferred embodiment 
the record includes one or more of the following: identification of an ORF; identification of 
a domain, region, or site; identification of the start of transcription; identification of the 
transcription terminator; the fiill length amino acid sequence of the protein, or a mature form 
thereof; the 5 ' end of the translated region. 

30 In another aspect the invention provides a machine-readable medium for holding 

instructions for performing a method for deteimining whether a subject has a 3 1939- 
associated disease or disorder or a pre-disposition to a 31939-associated disease or disorder, 
vdierein tiie method comprises the steps of determinmg 31939 sequence information 
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associated with the subject and based on the 31939 sequence information, determining 
whether the subject has a 31939-associated disease or disorder or a pre-disposition to a 
31939-associated disease or disorder and/or recommending a particular treatment for the 
disease, disorder or pre*disease conditioa 
5 The invention further provides in an electronic system and/or in a network, a method 

for determining whether a subject has a 3 1939-associated disease or disorder or a pre- 
disposition to a disease associated with a 31939 wherein the method comprises the steps of 

I 

determining 31939 sequence information associated with the subject^ and based on the 
31939 sequence information, determining whether the subject has a 31939-associated 

10 / disease or disorder or a pre-disposition to a 31939-assodated disease or disorder, and/or 
recommending a particular treatment for the disease, disorder or pre-disease condition. In a 
preferred embodiment, the mediod further includes the step of receiving information, e.g., 
phenotypic or genotypic information, associated with the subject and/or acquiring from a 
network phenotypic information associated with the subject The information can be stored 

15 in a database, e.g., a relational database. In another embodiment, the method further 
includes accessing Hie database, e.g., for records relating to other subjects, comparing the 
31939 sequence of the subject to Hie 31939 sequences in the database to thereby determine 
whether the subject as a 31939-associated disease or disorder; or a pre-disposition for such. 
The present invention also provides in a network, a method for determining whether 

20 a subject has a 3 1 939 associated disease or disorder or a pre-disposition to a 3 1 939- 

associated disease or disorder associated with 31939, said method comprising the steps of 
receiving 31939 sequence information from the subject and/or information related thereto, 
receiving phenotypic information associated with the subject, acquiring information from 
the network corresponding to 31939 and/or corresponding to a 31939-associated disease or 

25 disorder (e.g., a cell proliferative or differentiative disorder, e.g., cancer, or a neuronal 
disorder), and based on one or more of the phenotypic information, the 31939 information 
(e,g., sequence information and/or information related thereto), and the acquired 
information, determining whether the subject has a 3 1 939-associated disease or disorder or a 
pre-disposition to a 3 1 939-associated disease or disorder. The method may further comprise 

30 the step of recommending a particular treatment for the disease, disorder or pre-disease 
conditioiL 

The present invention also provides a method for determining whether a subject has 
a 3 1 939 -associated disease or disorder or a pre-disposition to a 3 1 939-associated disease or 
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disorder, said method comprising the steps of receiving information related to 31939 (e.g., 
sequence information and/or information related thereto), receiving phenotypic information 
associated with the subject, acquiring information from the network related to 31939 and/or 
related to a 31939-associated disease or disorder, and based on one or more of the 
5 phenotypic information, Ihe 31939 information, and the acquired information, determining 
whether the subject has a 31939-associated disease or disorder or a pre-disposition to a i 
31939-associated disease or disorder. The method may further comprise the step of / 
recommending a particular treatment for the disease, disorder or pre-disease conditioa j 

10 This invention is further illustrated by fte following examples that should not be 

construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this ^plication are incorporated herein by reference. 

EXAMPLES 

15 Example 1 : Identification and Characterization of Human 31 939 cDNA 

The human 31939 sequence (Fig. 1; SEQ ID N0:1), wliich is approximately 2493 
nucleotides long, including untranslated regions, contains a predicted methionine-initiated 
coding sequence of about 2142 nucleotides, including the termination codon (nucleotides 
indicated as "coding*' of SEQ ID N0:1 in Fig. 1; SEQ ID N0:3). The coding sequence 

20 encodes a 713 ammo acid protein (SEQ ID N0:2). 

Example 2: Tissue Distribution of 31939 mKNA by Northem Analysis 

Norfliem blot hybridizations with various RNA samples can be performed under 
standard conditions and washed under stringent conditions, i.e., 0.2xSSC at 65°C. A DNA 

25 probe corresponding to all or a portion of the 3 1939 cDNA (SEQ ID N0:1) can be used. 
The DNA was radioactively labeled with 32p-dCTP using the Prime-It Kit (Stratagene, La 
Jolla, CA) according to tiie instructions of the supplier. Filteis containing mRNA fix>m 
mouse hematopoietic and endooine tissues, and cancer cell lines (Clontech, Palo Alto» CA) 
can be probed in ExpressHyb hybridization solution (Clontech) and washed at high 

30 stringency according to manufacturer's recommendations. 
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Example 3: Recombinant Expression of 31939 in Bacterial Cells 

In this example, 31939 is expressed as a recombinant glutathione-S-transferase 
(GST) fusion polypeptide in E. coli and the fusion polypeptide is isolated and characterized. 
Specifically, 31939 is fused to GST and Ihis fusion polypeptide is expressed in K coli, e.g., 
5 . strain PEB199. Expression of the GST-3 1 939 fusion protein in PEBl 99 is induced wilh 
IPTG. The recombinant fusion polypeptide is purified firom crude bacterial lysates of the 
induced PEB199 strain by affinity chromatography on gluta^one beads. Using 
polyaciylamide gel electrophoretic analysis of the polypeptide purified firom the bacterial 
lysates, the molecular weight of the resultant fusion polypeptide is determined. 

10 

Example 4: Expression of Recombin ant 31939 Protein in COS Cells 

To repress the 3 1939 gene in COS cells, the pcDNA/Amp vector by Ihvitrogen 
Corporation (San Diego, CA) is used. This vector contains an S V40 origin of replication, an 
ampicillin resistance gene, an £1 coli replication origin, a CMV promoter followed by a 
IS polylinker region, and an SV40 intron and polyadenylation site. A DNA fragment encoding 
the entire 31 939 protein and an HA tag (Wilson et al (1 984) Cell 37:761) or a FLAG tag 
fused in-fi:ame to its 3' end of the firagment is cloned mto the polylinker region of the vector, 
thereby placmg the expression of the recombinant proteufunder Hxe control of the CMV 
promoter. 

20 To construct the plasmid, the 3 1 939 DNA sequence is amplified by PGR using two 

primers. The 5' primer contains flie restriction site of interest followed by approximately 
twenly nucleotides of the 31939 coding sequence starting firom the initiation codon; the 3' 
end sequence contains complementary sequences to the other restriction site of interest, a 
translation stop codon, the HA tag or FLAG tag and the last 20 nucleotides of tfie 31939 

25 coding sequence. The PGR amplified firagmmt and tilie pCDNA/Amp vector are digested 
with the appropriate restriction enzymes and the vector is dephosphorylated using the CIAP 
enzyme (New England Biolabs, Beverly, MA). Preferably the two restriction sites chosm 
are different so that tiie 3 1939j[ene is inserted in the correct orientation. The ligation 
mixture is transformed into K coli cells (strains HBlOl, DHSo, SURE, available firom 

30 Stratagene Cloning Systems, La Jolla, CA, can be used), the transformed culture is plated on 
ampicillin media plates, and resistant colonies are selected. Plasmid DNA is isolated fi'om 
transformants and examined by restriction analysis for ihe presence of the correct firagment 
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COS cells are subsequently transfected with the 3 1939-pcDNA/Ainp plasmid DNA 
using the calcium phosphate or calcium chloride co-precipitadon methods, DEAE-dedran- 
mediated transfection, lipofection, or electroporation. Other suitable methods for 
transfecting host cells can be found in Sambrook, J., Fritsh, E. F., and Maniatis, T. (1989) 
5 Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY. The expression of the 3 1939 



polypeptide is detected by radiolabelling (^^S-methionine or 3^S-<7steine available froih 
NEN, Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, D. (^988) 
Antibodies: A Laboratory Manual, Cold Sprmg Harbor Laboratory Press, Cold Spring 

10 Harbor, NY) using an HA specific monoclonal antibody. Briefly, the cells are labeled for 8 
hours with 35s-mefhionine (or 35s-(grsteine)^ Ihe culture media are ttien collected and the 
cells are lysed using detergents (RIPA buffer, 150 mMNaCl, l%NP-40, 0.1% SDS, 0.5% 
DOC, 50 mM Tris, pH 7.5). Bofli the cell lysate and Ae culture media are precipitated with 
an HA specific monoclonal antibody. Precipitated polypeptides are then analyzed by SDS- 

15 PAGE. 

Altematively, DNA containing the 31939 coding sequence is cloned directly into the 
polylinker of the pCDNA/Amp vector using the appropriate restriction sites. The resulting 
" plasmid is transfected into COS cells in the manner described above, and the expression of 
the 31939 polypeptide is detected by radiolabelling and immunoprecipitation using a 31939 
specific monoclonal antibody. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the spedfic embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the foUo wii^ig clauns. 
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1 What is claimed is: 



1 1. An isolated nucleic acid molecule selected from the group consisting of: 

2 a) a nucleic acid molecule comprising a nucleotide sequence \;^ch is at least 

3 80% identical to the nucleotide sequence of SEQ DD N0:1, or SEQ ID N0:3; 

4 b) a nucleic acid molecule comprising a fragment of at least 1000 nucleotides of 

5 the nucleotide sequence of SEQ ID NO: 1, or SEQ ID N0:3; 

6 c) a nucleic acid molecule which encodes a polypeptide comprising the amino 

7 acid sequence of SEQ ID N0:2; 

8 d) a nucleic acid molecule which encodes a fragment of a polypeptide 

9 comprising the amino acid sequence of SEQ ID N0:2, wherein the fragment comprises at 

10 least IS contiguous amino acids of SEQ ID NO: 2; and 

11 a nucleic acid molecule vAdch encodes a naturally occurring allelic variant of 

12 a polypeptide comprising the amino acid sequence of SEQ ID N0:2, i;*erein the nucleic 

13 acid molecule hybridizes to a nucleic add molecule comprising SEQ ID NO: 1, 3, or a 

14 complement thereoi^ under stringent conditions. 

1 2. The isolated nucleic acid molecule of claim 1 , which is selected from the 

2 group consisting of: 

3 a) a nucleic acid comprising flie nucleotide sequence of SEQ ID NO: 1, SEQ ID 

4 N0:3;and 

5 b) a nucleic acid molecule whidi racodes a polypeptide comprising the amino 

6 acid sequence of SEQ ID N0:2. 

1 3. The nucleic add molecule of claim 1 further comprising a vector nucldc acid 

2 sequence. 

1 4. The nucleic acid molecule of claim 1 furtfier comprising a nucleic acid 

2 sequence encoding a heterologous polypeptide. 

1 5. A host cell whidi contams fiie nucleic acid molecule of claim 1 . 

1 6. The host cell of claim 5 which is a mammalian host cell 
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1 7. A non-human mammalian host cell containing the nucleic add molecule 'of 

2 claun 1. 

1 8. An isolated polypeptide selected from the group consisting of: 

2 a) a polypeptide which is encoded by a nucleic acid molecule comprising a 

3 nucleotide sequence which is at least 80% identical to a nucleic acid comprising the I . 

4 nucleotide sequence of SEQ ID NO: 1 or SEQ ID N0:3; j 

5 b) a naturally occurring allelic variant of a polypeptide comprising the amino 

6 add sequence of SEQ ID N0:2, wherein the polypeptide is encoded by a nucleic acid 

7 molecule which hybridizes to a nucleic acid molecule comprising SEQ ID NO: 1, SEQ ID 

8 N0:3, or a complement thereof under stringent conditions; and 

9 c) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID 
10 N0:2, wherein the fragment comprises at least IS contiguous amino acids of SEQ ID N0:2. 

1 9. The isolated polypeptide of claim 8 comprising the amino acid sequence of 

2 SEQIDN0:2. 

1 1 0. The polypeptide of claim 8 fiirtfaer comprising a heterologous amino acid 

2 sequence. 

1 1 1 . An antibody which selective^ binds to a polypeptide of claim 8. 

1 12, A method for producing a polypeptide selected from the group consisting of: 

2 a) a polypeptide comprising the amino acid sequence of SEQ ID N0:2; 

3 b) a polypeptide comprising a fragmoit of the amino add sequence of SEQ ID 

4 N0:2, wdierein the fragment comprises at least 15 contiguous amino acids of SEQ ID N0:2; 

5 and 

6 c) a naturally occurring allelic variant of a polypeptide comprising the amino 

7 acid sequence of SEQ ID N0:2, wherein the polypeptide is encoded hy a nucleic acid 

8 molecule which hybridizes to a nucleic acid molecule comprising SEQ ID NO: 1 , SEQ ID 

9 N0:3, or a complement thereof under stringent conditions; 

10 &e mediod comprising culturing the host cdl of claim 5 under conditions in which 

1 1 the nucldc acid molecule is expressed. 
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! 



1 13, A method for detecting the presence of a polypeptide of claim 8 in a sample, 

2 comprising: 

3 a) contacting Hxe sample with an agent which selectively binds to a polypeptide 

4 of claim 8; and 

5 b) deterniining whether the agent binds to the polypeptide in the sample. 

1 14. The method of claim 13, wherein the agent which binds to the polypeptide is 

2 an antibody. 

1 15. A kit comprising an agent \i*ich selectively binds to a polypeptide of claim 8 

2 and instructions for use. 

1 1 6. A method for detecting the presence of a nucleic acid moleciile of claim 1 in 

2 a sample, comprising the steps of: 

3 a) contacting the sample with a nucleic acid probe or primer which selectively 

4 hybridizes to the nucleic acid molecule; and 

5 b) determining wfaeflier the nucleic acid probe or primer binds to a nucleic acid 

6 molecule in the sample. 

1 17. The method of claim 16, wherein the sample comprises mKNA molecules 

2 and is contacted with a nucleic acid probe. 

1 18. A kit comprising an agent vMoh selective^ hybridizes to a nucleic acid 

2 molecule of claim 1 and instructions for use. 

1 19. A method for identifying an agent whidi binds to a polypeptide of daim 8 

2 comprising the steps of: 

3 a) contacting a polypeptide, or a cell expressing a polypeptide of claim 8 with a 

4 test agent; and 

5 b) determining \siiether the polypeptide binds to the test agent. 
1 
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1 20. A mefliod for modulating Ihe activity of a polypeptide of claim 8 compiiisiDg 

2 contacting a polypeptide or a cell expressing a polypeptide of claim 8 with an agent which 

3 binds to the polypeptide in a sufficient concentration to modulate the activity of the 

4 polypeptide. 

1 21. A method for identifying an agent which modulates the activity of a / 

2 polypeptide of claim 8, comprising: | 

3 a) contacting a polypeptide of claim 8 with a test agent; and 

4 b) determining the effect of the test agent on the activity of the polypeptide to 

5 thereby identify an agent which modulates the activity of the polypeptide. 
I 
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Figure lA 

@:GRccftCGCGTCCGCCMGCca»3a:aaGC^^ 

f^gT k\ R G S P C P PL 12 
GCGTCCACG«»I^TGAACATTG^ GCC CGC GGC TCC OCG TQC CCC CC6 CTG 36 



OVIRTDTP KH .LRHL 112 

<iGcL<^J^G^ATCcLGTGATCC«;WX5GACi^raC 336 

J •«vMTvaKIBV6AFllG 132 

»Jt ^ <5g cJg Jc .^9 ilc ™ <5c »G ATC «e «» «« OOC «C »C «» 3»« 

ELF DNRI*TTVPTQ 152 

<5GcScjJccSci^J^C^0GMCTTTOT6»CJUU:C06CTCAa:JU»Gre 456 

"tel*WL RNHPl E S 172 

A«cSc,?CTLGScT^ilcC«G!!GC?CT§G<^GcSG(^(5GGSc<iG^ 

«.»TS.vTSE AAFEG riVHl.RYI. 212 

l^c£G<^<^TLATCTCG6AGGCGGCCTTCGAGGGGCTGGTCWtf:CTGCGClW^ 636 



A^i^<5Gc5cAL<5GlSG(^i:A^cSc(ic<&GTAGCCACCA«:GAOC^ 816 

4^GLcici^a?G<^G^^<5ci^CTGTCCCACAACAACCTGATGrrCGCTG 876 

wfP PT-HRLBRVHLNHNF 312 

C?Cc5c<^CTCT?CA?GC?CC^GCACCGCCTCGAGCGCOIGCACCrCAACC^ 936 

^cipTScj^C^^GJoC^caSoC^i^TSoTSGC^A^^I^C^cScASc 996 
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NTTCCARCHAPA GLKGRYIG 352 

AAC ACG ACG TGC TGC GCC CGC TGT CAT GCG CCC GCC GGC CTC AAG GG6 CGC TAC ATT G6G 1056 

I 

ELDQSHPTCYAPVIVE PPTD 372 

GAS CT6 6AC CAS TC6 CAT TTC ACC TGC TAT GCG CCC GTC ATC GTG GAS COG CCC ACG GAC 1116 

LN VTBGMAAELKCRTGT SHT 392 

CTC AAC GTC ACC GAG GGC AT6 GOT GCC GAG CTC AAA TGC CGC ACG GGC ACC TGC ATG ACC 1176 

SVNWX.TPNGTZikTH6SYRV R 412 

TCC GTC AAC TG6 CT6 ACG CCC AAC GGC ACC CTC ATG ACC CAC GGC TCC TAC CGC GTG CGC 1236 

ISV LHDGTLMPT HVTVQ DT G 432 

ATC TCC GTC CTG CAT GAC GGC ACG CTT AAC TTC ACC AAC GTC ACC GTG CAS GAC ACG |GOC 1296 

QyTCMVTNSAGMTTASATLN 452 

CAG TAC ACG TGC ATG GTG ACG AAC TCA GCC GGC AAC ACC ACC GCC TCG GCC ACG CTC AAC 1356 

VS AVD PVAAG G T G SGGG 6 P G 472 

GTC TCG GCC GTG GAC CCC GTG GCG GCC GG6 GGC ACC GGC AGC GGC G6G GGC GGC CCT GG6 1416 

GSGGVGGGSGGYTYFTTVTV 492 

GGC AGT GGT GGT GTT GGA GGG GGC AGT GGC GGC TAC ACC TAC TTC ACC ACG GTG ACC GTG 1476 

ETLBTQPGEEALQPRGTBKE 512 

GAG ACC CTG GAS ACG CAS CCC GGA GAG GAG GCC CTG CAS CCG CGG GGG ACG GAG AAG GAA 1536 

PPGP TTD GVWGGGRPGDAA6 532 

CCG CCA GGG CCC ACG ACA GAC GGT GIC TGG GGT GGG GGC CGG CCT GGG GAC GCG GCC GGC 1596 

PASSSTTAPAP RS5RPTBKA 552 

OCT GCC TCG TCT TCT ACC ACG GCA CCC GCC CCG CGC TCC TCG CGG CCC ACG GAG AAG GCG 1656 

FTVPITDVTENALKOLDDVM 572 

TTC ACG GTG CCC ATC ACG GAT GTG ACG GAG AAC GCC CTC AAG GAC CTG GAC GAC GTC ATG 1716 

KTTKIIIGCPVAITPMAAVM 592 

AAG ACC ACC AAA ATC ATC ATC GGC TGC TTC GTG GCC ATC ACG TTC ATG GCC GCG GTG ATG 1776 

I,VAFyKI.RKQHQI*MKHHGPT 612 

CTC GTG GCC TTC TAC AAG CTG CGC AAG CAG CAC CAG CTC CAC AAG CAC CAC GGG CCC A06 1836 

RTVEIINVBDBIfPAASAVSV 632 

CGC ACC GTG GAG ATC ATC AAC GTG GAG GAC GAG CTG CCC GCC GCC TCG GCC GTG TCC GTG 1896 

AAAAAVASGGGVGGDSHItA L 652 

GCC GCC GCG GCC GCC GTG GCC AGT GGG GGT GGT GTG GGC GGG GAC AGC CAC CTG OCC CTG 1956 

PALERDHLNHHHYVADA FKA 672 

CCC GCC CTG GTkG CGA GAC CAC CTC AAC CAC CAC CAC TAC GTG GOT GAC GCC TTC AAG GCG 2016 

HYSSNPSGGGCGGKGPPGLN 692 

CAC TAC AGC AGC AAC CCC AGO GGC GGG GGC TGC GGG GGC AAA GGC CCG CCT GGC CTC AAC 2076 

SXHBPLLFKSGSRBNVQETQ 712 

TCC ATC CAC GAA CCT CTG CTC TTC AAG AGC GGC TCC AAG GAG AAC GTG CAA GAG ACG CAG 2136 
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joaant domain 1 of 1. from 56 to 85: sc r 33.5. B - 4.8e-26 
uonxt «»« ._>aCpreCtC8pfglWdC8grgLtlevPrdLP<-* m^f^^ 

Cp* C C8+ C+*r L evP +P 
31939 56 sCPWaCSCSHQASBVICTRRDlA-EVPASIP 85 

Un* dooain 1 of 9. from 87 to 110: score 16.1. E = 0.83 ^g- 

n ♦♦L*t+ N+++ + ++f+*l^ 
31939 87 NTIOTJiriaEMOlClVIIOTOTPKHLR HO 

unit domain 2 of 9, from 111 to 134: score I'-^-B* 0.093 
u«t ooma *->nl.eeLdL8nN.Lt8lppfllf8nI«x-* ^CQ, vO«> 

♦Le L Ii8 N ++++ g+f-^+LP 
31939 111 HLBILQLSiailVRKIEVGAPNGLP 134 

ZMl« domain 3 of 9. from 135 to 158: score Z^^}rslJDl€ 

•->nLeeLdL8nM.Lt8lppglf8nLp<-* iwl*^'* 



+1, +1. L ♦M+Lt++p ♦f* I** 

31939 135 SIJm.ELFDNrLTTVPTQAPEn.S 158 



xmit domain 4 of 9. from 159 to "2: score 25^. E- 0.0016^ 

*->nLeeljdLsnlI.i.t8lppglf8nli><-* w» 
+Iri^ti+L+nM*++8+p+ +f+ P 
31939 159 KUlELWIiRNIJpIESlPSYliraKVP 182 

um: domain 5 of 9. from 183 to 207: f core 11.1. E = 23 

•->nLeeLdLsnN. .Ltslppglf siiLp<-* Wi-*^ 
■fli-t-t-Ldl. *++ +f++Ii 

31939 183 SUtKLOLGBLlcnsnSBAAFBGLV 207 

lARi domain 6 of 9, from 208 to 229: score 10^. E= 31 

«.>ni.eeLdLsnN.I.tslsval£snIip<>* 

31939 208 MLRVLNLGMCSnlKDlPM — LTOLV 229 

LMK domain 7 of 9. from 230 to 253: score 21-8. E = 0 016 

•_>nLeel.dLsnN.Lt8lppglfsnLp<-* SC^ 
•i-X>eeL Ls-^N^-L + pg+£++L+ 
31939 230 RT.RRr.Kfi.SSIhaJiLIIieGSFQGM? 253 

XMt. donain 8 of 9. from 254 to 277: score 9.8. E = 37 ^ 

*->nl.eei:.dLsnN. Ltslppglf snl.p<-* VO 
+L++L+L + + +♦+ ++f++L+ 
31939 254 SLRKUniMHJUJtfATIERHAFDDLK 277 

XMt( dcMain 9 of 9, from 278 to 301: score 24.9. E = 0.0019 ^ 
snu oDUB •->nLe«LdLsnK.LtslppglfsnLp<-* SBO< VD W^^S 

+LeeL-i-L8-i-N+L alp +lf !• 
31939 278 SLEEUILSHlJlnLMSLWJDLFTPLH 301 

XdUlCSs domain 1 of 1, from 311 to 362: score 29.8, B = 6.2e-05 
umcss ooma *:>HpfnCDCeLrwLlrWlretnprrledqedlrCa8PeslrG<ipllell 

HP++C+C+ +VIL+ Ml+e ++ + + +rC P J+S**^. 

psdf sCcK-* vt^ MP' Ca 

+s £+C 

31939 356 QSHFTCY 362 
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Figure 4A 



Iff* domain 1 of 1. from 378 to 438: -core 28.9. B - 2.2e-07 

C ♦ L C+ ♦ ♦♦v+Wl +♦++ ++♦+ + + + ^tLl^ 

31939 378 CMMffiLKCRTG TSBfPsVNWLTPllGtlBithosyrvrlSVlhdGIL 421 



ti.»vtptf)8QatYtcw<-* 6^ lt>l3!D*7 

•f -i-fvt D+ G YtC+v 
31939 422 KPtKWTVQDT-GQXTClW 438 
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Figure 4B 



loTL-St dcanain 1 of 1, from 380 to 438: score 11.8, E ■ 0.032 

«->BvtL8ClcaSflft£yv8WvrqpPB»tgkglsglewlBy.ByKaRvtxsk 

++L C++ ++ + V+W+ ♦+ ♦•y* 

31939 380 AABLKCRTGTSmSVllWLWM GlUITHgSnWRISVLH 417 

deltraskndval.tlBiiLxp«DtgttVCar<-*' 
d +Ii ++0+++ Dtg Y+C ♦ 

31939 418 DG ^TLNFTMVTVQOTGQVTCIIV 438 

ioeajSt domain 1 of I, from 376 to 443: score 46.6, B = 5.6e-10 

*->leGq8VtLtCpasfl4>winiltWlk..okplp sBStl. 

e6 +++I.+C+ +g ♦+ Wl ++0+ + ♦++ ♦ + + 
31939 376 TBGMAABLlCCR-JTeTSIIPSWJWLITpOCi^^ «1 

tilenval«D80lYtCvAcKBVB<-* ^ WP*. ^ 
+++nv++ D 6 YtC ++N8+g 
31939 422 NFrrNVTVQDTGQYTCIIVTOSAO 443 

XQJIe* domain 1 of 1, from 370 to 454: score S3. 4, E « 4.Be-12 

*->pp.vtvkeGesvtLtC.a8. . . .titwy 

p ++ V 66 **h*C'¥ ♦+ +++ W++++++ ♦++♦ + + + 
31939 370 pTdlMVTBGMIUaUSCrTOtamtSVMini^pogtlmthasyr^ 416 

Ltl.nvtpeDBeBoGtYtCaat..sgsaB8e«ttI.tVl<-* SJBat&MXlO 

4.4.^^L+++nvt++D+ G Ytc++t+++g++++ ♦+tl. V* 
31939 417 hdgtLNFtMVTVQDT GCrrrO!VTnBJVam»--SATiaiVS 454 
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